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(57) Abstract 



Bacterial adhesins that have been selected or recombined to have the ability to bind specifically to pre-determined. selected inanimate 
or animate receptors and the use of such adhesins or bacteria expressing the adhesins. in the targeting of useful compounds and/or bacteria 
to selected cells and surfaces. 
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RECEPTOR SPECIFIC BACTERIAL ADHESINS AND THEIR USE 
FIELD OF INVENTION 

The present invention pertains to naturally occurring bacte- 
rial adhesins and derivatives and variants hereof, having the 
ability to bind to pre -determined, specifically selected 
receptors, and to the use of such adhesins in the targeting 
of active compounds and microbial cells to locations compris- 
ing such selected receptors. 

This invention was supported in part by the US National 
10 Institute of Health (NIK), under grant #DE07218, and the US 
Veterans Administration. The US government has certain rights 
in the invention. 
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TECHNICAL BACKGROUND AND PRIOR ART 

The ability to adhere or bind specifically to, and in many 
15 instances, to colonize an animate or inanimate surface is of 
paramount importance in microbial ecology and pathogenesis. 
Such specific receptor binding is provided by microbial 
adhesins which play a key role in bacterial /host and 
viral /host recognition and interaction and for the recogni- 
tion of any specific surface by a microorganism. 



20 



Accordingly, adhesion of bacteria to host surfaces is common- 
ly regarded as an essential step enabling bacteria to become 
established as members of the normal flora of host organisms 
or to cause an infection (refs. 7, 18). Bacterial lectins are 
25 the most common and most thoroughly studied type of adhesins 
among both gram negative and gram positive bacteria (ref . 
40) . Evolutionary pressures have selected lectins for adhes- 
ive functions probably due to the abundance of glycoconjugat- 
es on animate and inanimate surfaces. One class of structures 
that a large range of gram-positive and gram-positive 
bacteria including Escherichia, coli and other members of the 
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family ED«r=t«c«riaceae, have evolved to adhere to host 
glycoproteins in a saccharide-dependent n,anner are surface 

atxon Factor Antigen (CPA) type I and Colonization Factor 
5 Antigen ,CPA, type II are specific examples of such fin^riae 

By far the „„>st con^on of the enterobacterial finiriae is 

IT l:Jl\T'T""^°^'"' "^«^- 
10 ' fimbriae are heteropolymers of four different 

10 subunits (refs :>ft c , ^-i-j-i-exeac 

of a 17 vdJ ! ^"'^ £in*ria. about 1000 copies 

Pill, \T r"""^ «-"-ral subunit designated Fi^ (or 
r hoJ^o! ^"f^"'^"^'' - right-handed helix surrounding 

a hollow axial core (ref . li, . Three ancillary subunits 
FimF, FimG and FimH are _ i ""unius, 

LS - polymerized into the fimbrial 

L5 structure, but comprise onlv l-3» of n,- . ■ w • , 

20, 24, 27, 32) fMibrial mass (refs. 

The 28 IcDa FimH subunit has been shown by several direct and 
indirect tests to be the actual fi^rial lectin (refs 2, t 

0 TfLlt'/l' ■ ■ -""-ah its function :„y b^ 

affected by other subunits (ref. 55,. The PimA subunit is 
highly variable, but the Fi,^ subunit is highly consel^ed 

_ rec tors have beeTsL^ l^T^;- 

key role xn the infectious process (refs. 2, 4, 9 X9 25 
26, 31, 33, 44, 50) . ' ' 

Detailed analysis of adhesion- inhibition or agglutination 

n dxrect adhesxon studies using "neoglycolipids" 
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as receptors (refs. 37, 47). The combining site of the Kleb- 
siella pneumoniae type 1 acihesin was shown to be similar to 
the Escherichia coli adhesin, whereas the Salmonella typhimu- 
rium type 1 adhesin combining site appears to be smaller and 
devoid of a hydrophobic region (ref. 16) Thus, it has long 
been thought that type 1 fimbriae of enterobacteria were 
functionally quite similar and that the primary essential 
characteristic of any potential receptor was the presence of 
terminal al- 3 -linked mannosyl residues. 

Recently it has been reported that the type i fimbriated, K- 
12-derived E. coli strain CSH-50 exhibits mannose- sensitive 
peptide -binding activity (ref. 51). CSH-50 E. coli bound to 
yeast mannan (Mn) , a highly mannosylated glycoprotein, and to 
human plasma fibronectin (Fn) when immobilized on assay 
wells. Adhesion to Mn, but not to Fn, was essentially elimin- 
ated by periodate treatment. Furthermore, CSH-50 E. coli 
adhered in a mannose -sensitive fashion to non- glycosylated 
peptide fragments of Fn and to a synthetic peptide copying 
the first 30 residues of the Pn molecule, FnSpi. Fimbriae 
purified from these organisms also bound to Fn and FnSpl A 
well -characterized recombinant strain of e. coli pc31 expres- 
sing type 1 fimbriae, HBlOl (pPKL4) , adhered to Mn, but did 
not adhere to the other substrata. Fimbriae purified from 
HB101(pPKL4) did not adhere to Fn or FnSpl . Thus, E. coli 
type 1 fimbriae appeared to be functionally heterogeneous. 

Several E. coli isolates obtained from human urine also 
expressed peptide -binding activity similar to that of CSH-50 
indicating that this new phenotype was not restricted to a 
laboratory strain. Other isolates expressed an adhesive 
activity similar to that of HBlOl(pPKL4) . a third class of 
type 1 fimbriae-mediated adhesive phenotype was also observed 
among these isolates. 

The FimH subunit is the D -mannose -sensitive adhesin of type 1 
fimbriae, common i.a. to the Enterobacteriaceae . It is pres- 
ently widely accepted that host receptors are strictly 
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limited to glycoproteins containing terminal mannosyl resi- 
dues (refs. 16, 37, 41, 42, 43, 47). Hereinbelow functional 
and genetic evidence is provided demonstrating that this 
generalization is not correct. Allelic variants of E. coli 
fimH genes encoding proteins differing by as little as a 
single amino acid substitution confer distinct adhesive 
phenotypes and accordingly, the fimH gene is not a single 
gene but rather a family of fimH genes . 

surprisingly, active receptors for FimH proteins were found 
to include glycoprotein domains where mannosyl residues are 
not terminal and even protein domains devoid of saccharide 
Thxs unexpected adhesive diversity within the fimH family 
broadens the scope of potential receptors for bacterial 
adhesion and may lead to a fundamental change in the under- 
standing of the role(s) type l fimbriae and other bacterial 
adhesins may play in bacterial ecology or pathogenesis. 

The present findings also opens up a completely new field of 
technology, since it provides the means to design bacteria 
expressing adhesins that bind to pre -determined, specific 
20 receptors in a wide range of anirrate and inanimate locations. 

TecLT ^° ^--n as Designer Adhesin 

Technology, 
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SUMMARY OF THE INVENTION 



Accordingly, the present invention relates in one aspect to a 
recombinant or .utant bacterial adhesin variant derived from 

^'^^si.n, said adhesin variant having 
altered bxnding properties relative to the naturally occur- 
ring adhesin from which it is derived. 

in further aspects the invention provides a FimH adhesin 

PC31 FxmH adhesxn by at least one amino acid and a recombi- 
nant replxcon comprising a DNA sequence selected from the 
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group consisting of a sequence coding for a recombinant 
bacterial adhesin variant as defined above and a sequence 
coding for a FimH adhesin as also defined above. 

In a still further aspect, there is provided a fusion protein 
5 comprising an adhesin selected from the group consisting of a 
recombinant bacterial adhesin variant as defined above and a 
FimH adhesin as also defined above, and a heterologous 
polypeptide. 

The invention also pertains to a recombinant bacterial cell 
10 which expresses an adhesin selected from the group consisting 
of a recombinant bacterial adhesin variant as defined above 
and a FimH adhesin as defined above, and to a composition 
comprising a population of such cells. 
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in one interesting aspect of the invention there is proT>ided 
a method of isolating a bacterial cell expressing an adhesin 
having modified binding properties relative to a natively 
expressed adhesin, comprising identifying in the bacterial 
cell DNA sequence (s) coding for the binding domain (s) of said 
natively expressed adhesin and substituting at least one 
codon herein, whereby a modified adhesin molecule is 
expressed that is different in at least one amino acid from 
the adhesin expressed natively, and selecting a bacterial 
cell expressing the modified adhesin having an altered 
adhesion phenotype relative the natively expressed bacterial 



25 adhesin. 



In a further interesting aspect the invention relates to a 
method of preparing a recombinant bacterial cell that binds 
to a specific receptor moiety, comprising introducing into a 
bacterium that does not produce an adhesin binding to said 
receptor moiety, a DNA sequence coding for an adhesin binding 
to the receptor moiety, and selecting a bacterial cell ex- 
pressing the DNA sequence. 
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I*ere is also provided a method of targeting a bacterial 
adhesxn to a specific location, ccnprising ,i, Identifying in 
saxd location an adhesin- interacting receptor noiety which is 
recognizable by bacterial adhesins, said moiety preferably 
bexng one which is occurring abundantly, ,ii, isolating a 
bacterial cell that grows in said location and expresses an 
ITII interacting with said receptor moie- 

ty, and administering to the location the bacterial cell or 
the adhesin under conditions where the adhesin and the 

the'ftoL""'"'' interacting contact whereby 

the adhesin is associated with the receptor moiety 



DETAILED DISCLOSDEE OF THE INVEKTION 



^ used herein the. te™ "bacterial adhesins" denotes proteins 
Which recognize and bind to a large variety of target mole 
IS cules such as polysaccharides, glycolipids! glycoproteris 
polypeptides and proteins. More than a hundred different 
adhesins have been deacrlh=,i <, . . ^'■merent 

variety of a.,J '^««='^^'«<1 so far originating from a large 
olTlT° '"^-"^3"^-= gram-positive bacteria. Adhesins 
20 T "^"^ bacterial surface as components of 

20 -sanelles such as fimbriae, also called plli or frbrillae 
these three terms being used interchangeably herein or as' 
non^fi^rial or afimbrlal adhesins ,ref. e.l HxII^ies of 
Ein^rial or pill adhesins include the following surface 
structures m E. coll: P pui. , ^"^^^ 

Zl'- =^31 A,, i! C;ieiL 

pneumoniae., type 3 pin, ^^r,,etsll. pertussis. ^^,T 

p:i:L^*^pHT:n:'^ — ocontlca: Myf fibrlllae; wLLnla 
pestis. PH6 antigen and Fl envelope antigen. 

Exanples of non-fimbrial c.i i 
JO adhesin function or which^^ structures which have 

function include cipsules "Lo3"" T""" ^ 
nembrane proteins ml ZL'IT 

3, NPA.4 Arnr^: H <°°n-£i"brial adhesin)-!, NFA.2, nfa- 

FA 4, AFA (afimbrial adhesins). I, afa-h and AFA-lli 
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In the present context, the term "fimbriae" designates long 
thread-like bacterial surface organelles. Fimbriae are hete- 
ropolymers each consisting of about 1000 structural compo- 
nents, mostly of a single protein species. However, in many 
cases a few percent minor components are also present. Adhes- 
ins can either be identical to the major structural protein 
as in Escherichia coli K88 and CFAl fimbriae and type 4 
fimbriae of Pseudomonas , Vibrio and Neisseria, or they may be 
present as minor components as in E. coli type 1 and P 
fimbriae [for reviews see Krogfelt 1991 (ref. 62) ; Kaufman 
and Taylor, 1994 (ref. 60): Kuehn et al . , 1994 (ref. 63); 
Klemm and Krogfelt, 1994 (ref. 61)]. In the latter case/i. 
when present as minor compounds, the adhesins are closely 
related in amino acid sequence to the major fimbrial compo- 
nent. As used herein the term bacterial adhesin will also 
include adhesins isolated from non-bacterial sources includ 
ing viruses, and which are expressed in a bacterium. 



e . 



In the following, the FimH adhesin of type 1 fimbriae will be 
described structurally and functionally as a representative 
example of a bacterial adhesin. 



FxmH is located at the tip of the type 1 fimbriae and also 
intercalated at inter^ls in the fimbrial organelle. Most 
forms Of the FimH adhesin target to (bind to) oligosaccharide 
structures containing terminally located a-D-mannoside resi- 

25 dues [Krogfelt et al . , 1990 fref -jq^ i ia=e.«^ 

J.., j.:y»u trer. 29) j. Based on studies with 

various D-mannose derivatives the receptor binding site of 
the FimH adhesin is assumed to be shaped like an elongated 
pocket large enough to accommodate a triaaccharide motif 
[Sharon, 1987 (ref. 65)]. 



The f^nai gene encodes the precursor FimH protein of 300 amino 
acids [Klemm and Christiansen, 1987 (ref. 27)]. This precur- 
sor is processed into a mature form of 279 amino acids The 
amino acid sequence of the E. coli PC31 FimH protein is shown 
xn Table 1 below wherein cysteine residues are indicated by 
asterixes, the signal peptide is outlined in bold letters 
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and two regions contributing to the binding site are under- 
lined (SEQ ID NO:l). (it should be noted that residue 176 is 
a proline residue and not as previously indicated when the 
PC31 FimH protein was first published, an arginine residue) • 



FimH protein 



20 



25 



-21 ^ * 

MKRVITLPAVLLMGWSVNAWSFACK'] 



^° 'rQ^FCHNDYPKTTTT)y VTT,01?r,SAYr^ VT..QMPCr.>mn.^eo ..^rpp^ 
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:TANGTAIPIGGQSANVYVNIAPWNVG ONLWDr.g 

'SETPRWYNSRT 

DKPWPVALYLTPVSSAGS3£aiKAGSLIAVLII^QTNNYNSDDFQFVWNIYA^ 
GCDVSARDVTVTLPDYPGSVPIPLTVYCAKSQNLGyYLSGTHADAGNSIFT^^ 

279 

LGAVGTSAVSLGLTAHVARTGGQVTAGNVQSIIGVTFVYO 



15 GVGVQLTRNGTIIPANNTVS' " 



The Fin« contains 4 cysteine residues assumed to direct 
folding Of the n,olecule into distinct functional dontains For 
co-nparxson Pi^ and the .ninor co-rponents Fi^ and Fi^ only 
have two cysteine residues. The localization of the cysteiL 
residues in Fi», points to a tandem arrangement of twl ^ces 
tral genes. Furthermore, similar a.i„o acids can he four" 
imxlar positions in the two halves of the FimH proteir T^e 
■-dway. point is located roughly around residue ISO n't^ 
-ture protein. The two halves or domains of FimH have 
evolved differently with the »- terminal section becoming the 
domain harbouring the receptor binding site, whereas III c 
terminal sector became the domain of the molecule retired 
for integration into the fimbrial organelle structurf I t 
having the features of a structural co,,^onent. 

in-frame linker insertions into the fimH gene confirms this 
model Of the FimH protein. Thus insertions in the C^HrT", 
half of the molecule generallv rt., • I C- terminal 

1= generally do not interfere with the 
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receptor- binding ability whereas abolishment of receptor 
binding ability following linker insertion in the N- terminal 
is the rule (Klemm et al . , unpublished data), a similar 
domain structure has been observed in the PapG adhesin.of p- 
fimbriae [Hultgren et al., 1989 (ref. 59); Kuehn et al., 1994 
(ref . 63) ] . 

in accordance with the invention, the recombinant bacterial 
adhesin as defined above is one which is derived from an 
adhesin having certain binding properties, but which recombi- 
nant bacterial adhesin has altered binding properties rela- 
tive to the naturally occurring adhesin (the parent adhesin) 
from which it is derived. As used herein this feature 
encompasses situations where the adhesin variant recognizes 
and binds to receptor moieties not being recognized by the 
parent adhesin irrespective of whether the adhesin variant 
has lost its normal ability to recognize and bind to a cer- 
tain receptor moiety or certain receptor moieties, or not. 

As used herein the term "binding" indicates that the adhesin 
has a degree of affinity to the receptor moiety which enables 
it, when brought into contact herewith, to interact in a 
binding manner with this moiety whereby an adhesin- receptor 
moiety association occurs. The strength of this binding 
depends on the type of binding force which causes the inter- 
action between the receptor moiety and the adhesin. In the 
present context, such binding forces include covalent binding 
and binding by non- covalent binding forces including hydrogen 
bonds, hydrophobic interactions, van der Waal forces and 
ionic interactions. Accordingly, the term "receptor moiety" 
as used herein encompasses any moiety to which an adhesin may 
30 interact by the above binding forces. 

In one specific embodiment, the adhesin variant is a FimH 
mannose-sensitive adhesin normally binding to a receptor 
selected from a domain where mannosyl residues are not ter- 
minal and a domain devoid of saccharide and having an amino 
35 acid sequence which differs from the E. coli PC31 PimH adhes- 
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in by at least one amino acid residue substitution, including 
an amino acid sequence differing by at least 2 amino acids, 
preferably by at least 3 amino acids, more preferably by at 
least 4 amino acids, most preferably by at least 5 amino 
acids. In further useful embodiments, the amino acid sequence 
may even differ by more than 5 amino acids such as at least 
6, preferably by at least 7, more preferably by at least 8 
even more preferably by at least 9 and in particular by at' 
least 10 amino acid residues, such as by at least 12 amino 
acids including by at least 15. 

Accordingly, the above FimH adhesin variant is preferably at 
least 90% homologous to the PC31 FimH adhesin, such as at 
least 92% homologous, more preferably at least 93% 
homologous, even more preferably at least 94% homologous, 
most preferably at least 95% homologous, and in particular at 
least 96% homologous, e.g. at least 97% homologous. In par- 
ticularly interesting embodiments, the adhesin is at least 
9 8% homologous, including at least 99% homologous such as at 
least 99.5% homologous. 

The above FimH adhesin variant can be a chimeric adhesin 
comprising amino acid sequences from different FimH adhesins 
having identical or different binding specificities. 

AS it has been mentioned above, the present invention is 
generally aimed at providing the means to design bacterial 
adhesins having specific binding properties whereby bacteria 
expressing the adhesin variants or the adhesin variants in 
xsolated or purified form can be designed to bind to a speci- 
fxc desired target receptor moiety. Accordingly, the adhesin 
variant may in accordance with the invention be an adhesin 
variant as defined above which binds to an animate receptor 
moiety. Such receptors include receptors located on inner 
surfaces of humans and animals, such as e.g. the mucosal 
membranes of the gastrointestinal tract including the teeth 
and the oral cavity, and the mucosal membranes of the respir 
atory and the genitourinary systems. Included are also 



wo 95/20657 '^^^ PCT/DK^5/00042 

11 

adhesin variants that bind to outer surfaces, including the 
skin, of huinans and animals. 

In a further embodiment, the adhesin variant is designed so^ 
as to acquire the ability to bind to a plant receptor moiety. 
5 This aspect is of particular interest in relation to deliber- 
ate release to out -door or in-door environments where plants 
are cultivated, of useful recombinant bacteria having a 
desirable effect on the growth and yield of the plants. 
Such desirable bacteria are e.g. bacteria expressing a pesti- 
10 cidally active substance, i.e. a biopesticide including as 

examples pesticidal toxins produced naturally by BslcIHus spp 
such as the Bacillus thuringiensis (Bt) toxin. In this con- 
text, another example is bacteria which protect plants 
against low temperature damages or bacteria which express 
gene products protecting plants against detrimental effects 
of herbicides. 



15 



20 



By providing such bacteria with genes expressing adhesin 
variants which e.g. bind specifically to certain plant spe- 
cies and/or to certain locations on the plant, these useful 
bacteria will, when administered to the plant growing envi- 
ronment, be selectively associated with the target plant 
species or a specific target area of the plant. It may thus 
be desirable to have these useful bacteria administered to 
the leaves of the plants or to have the root system colonized 
25 herewith. 



Accordingly, the present invention encompasses adhesin vari- 
ants as defined herein which bind selectively or specifically 
to a phylloplane receptor moiety or which bind to receptors 
on plant roots. Similarly, adhesin variants can be provided 
30 which are targeted to the stem or the flowers of the plants. 

As it is mentioned above, bacterial adhesins include adhesins 
having an inherent capability to bind or interact with inani- 
mate surfaces carrying receptor moieties with which the 
adhesin can interact to become bound to the surfaces. It is 
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Jcnown that certain bacterial adhesins can bind to inanimate 
surfaces including as examples glass, hydroxyapatite (a tooth 
enamel model compound) or polymer structures including plas- 
tics and polysilicates. The present invention has made it 
possible to design bacteria which bind selectively to any 
inanimate surface which carries a receptor moiety for which 
an adhesin variant binding thereto may be constructed. Accor- 
dingly, the present invention also provides an adhesin vari- 
ant as defined herein which binds to an inanimate receptor 
moiety. Such adhesin variants are particularly interesting in 
connection with the concept of bioremediation, i.e. a tech- 
nology designed to enhance degradation of chemical pollutants 
in the environment. It is clearly a significant improvement 
of this technology to have at hand bacteria which comprise 
genes coding for pollutant -degrading gene products and which 
also express adhesins targeting the bacteria selectively to 
the environment where the pollutants are present, e.g soil, 
aquatic environments and drinking water supply systems. 
Furthermore, adhesin variants capable of binding to tooth 
enamel are useful in the protection of teeth against caries. 

In a further embodiment, there is in accordance with the 
invention provided an adhesin variant which is part of a 
fusion protein comprising the adhesin variant and a non- 
adhesin. heterologous polypeptide. Using the FimH as an 
example, it has been found that fusions between a bacterial 
adhesin and other proteins can be made whereby the resulting 
fusion proteins are inserted into the cell surface organelle 
Of whxch the adhesin is a structural part. These resulting 
hybrid adhesin- carrying cell organelles remain fully func- 
tional with respect to binding properties. Additionally it 
has been found that large regions of non- adhesin proteins 
e.g. regions comprising in the range of 1 to 100 amino acids 
including a range of 5 to 75 amino acids and a range of 10 to 
60 amino acids, such as regions comprising 15 to 54 amino 
acids, can be inserted into type 1 fimbriae without impairing 
the binding properties of the fimbriae. 
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In useful embodiments of the invention, the non-adhesin 
region of a fusion protein comprising an adhesin variant as 
defined herein include a heterologous polypeptide which is 
selected from an epitope, an enzyme, a toxic gene product and 
an antibody. 

It has significantly been found that, when fusion proteins 
are expressed in which the heterologous polypeptide is an ■ 
epitope or an epitope- carrying domain forming an integrated 
part of the fusion protein, and thus presented on the surface 
of the host cell expressing the fusion protein, the epitope- 
carrying polypeptide can be presented in a conformational 
form similar to its natural conformation. 

Furthermore, it has surprisingly been found that the above 
fusion proteins can be overproduced by the bacteria compris- 
ing hybrid genes coding for fusion proteins, resulting in 
excretion of the fusion proteins to the growth medium in 
large quantities. Accordingly, the excreted fusion proteins 
are then readily isolated and purified, e.g. by means of 
affinity chromatography. These findings provide the means to 
manufacture bacterial cells having on their surface hybrid 
adhesin- carrying cell organelles as well as to produce large 
quantities of excreted fusion proteins, both of which can be 
targeted to specific surfaces as determined by the binding 
properties of the adhesin variant of the fusion protein. 

The above technology of making adhesin variant - fusion pro- 
teins is useful for a range of industrially important pur- 
poses such as: 



(i) development of live vaccines targeted to specific cellu- 
lar surfaces; 

(ii) development of subunit vaccines for administration 
orally or by injection, which are targeted to pre-determined, 
specifically selected cell surfaces or mucosal surfaces; 
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(iii) development of fusion proteins combining specific 
binding properties with specific enzymatic or toxin activ- 
ities. Such fusion proteins have applications as therapeuti- 
cal or diagnostical agents, including use in biosensors; 

(iv) use of fusion proteins as carriers of non-covalently 
linked chemical moieties whereby the adhesin part of the 
protein is used to target the chemical moiety to specific 
locations and the non-adhesin part carries and then releases 
the moiety when the fusion protein has reached its target 
Examples of chemical entities which may be linked to the 
fusion protein include imaging agents and pharmacologically 
active components. Examples of applications for this use 
xnclude imaging of atherosclerotic plaques or tumor tissues, 
and delivery of chemical agents at specific locations in or 
on microbial, human, animal or plant cells including specific 
tissues or tissue components; 

(V) development of fusion proteins which are useful in affin- 
ity purification processes. 

It has been found that the fimH gene coding for the E. coli 
FimH adhesin is not a single gene but rather a family of fimH 
genes, and accordingly it has now been established that 
allelic variants of E. coli f±n^ genes exist that encode 
adhesin proteins which, relative to the known E. coli PC3l 
f.rnH gene product differ by as little as a single amino acid 
substitution and confer distinct binding or adhesive 
phenotypes , 

Accordingly, as it has been mentioned above, the present 
invention relates in a further aspect to a PimH adhesin 
having an amino acid sequence which differs from the E. coli 
PC31 FimH adhesin as defined above by substitution of at 
least one amino acid, it will be understood that such an 
adhesin encompasses naturally occurring adhesins as well as 
adhesins which are encoded by recombinant or mutant fi^ 
genes, m this context the term "fimH gene" denotes a gene 
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coding for a gene product which can be integrated into a type 
1 fimbria and which confers to the fimbria the eibility to 
recognize and bind to a receptor. 

The FimH adhesin as defined above may be an adhesin having 
5 its inherent binding properties or an adhesin variant which 
in relation to an adhesin encoded by a naturally occurring 
gene from which the gene coding for the adhesin variant is 
derived, has altered binding properties. Furthermore, the 
FimH adhesin may be either mannose- sensitive or mannose- 
10 insensitive. The term "mannose- sensitive" is used herein to 
designate that the binding of an adhesin is inhibited in the 
presence of mannose residues. In one specific embodiment, the 
FimH adhesin may be a FimH adhesin normally binding to a 
receptor moiety selected from a domain where mannosyl resi- 
dues are not terminal and a domain devoid of saccharide such 
as e.g. a glycolipid, a glycoprotein, a protein, a 
polypeptide and a peptide, including a hormone. Examples of 
proteins to which a FimH adhesin according to the present 
invention may bind include as examples animal proteins such 
as a casein including -casein, a gelatine, a globin, an 
albumen and a collagen, and vegetable proteins including soy 
protein. 
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The FimH adhesin according to the invention include an adhes- 
in having an amino acid sequence which differs from the E 
coli PC31 FimH adhesin by at least 2 amino acid residues, * 
such as an amino acid sequence differing by at least 3 aL±no 
acids, preferably by at least 4 amino acids, more preferably 
by at least 5 amino acids, most preferably by at least 6 
amino acids. In further useful embodiments, the amino acid 
sequence may even differ by more than 6 amino acids such as 
at least 7, preferably by at least 8, more preferably by at 
least 9, even more preferably by at least 10 and in particu- 
lar by at least ii amino acid residues, such as by at least 
12 amino acids including by at least 15. 
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Accordingly, the above FimH adhesin is preferably at least 
90% homologous to the PC31 FimH adhesin, such as at least 92% 
homologous, more preferably at least 93% homologous, even 
more preferably at least 94% homologous, most preferably at 
5 least 95% homologous, and in particular at least 96% 

homologous, e.g. at least 97% homologous. In particularly 
interesting embodiments, the adhesin is at least 98% 
homologous, including at least 99% homologous or at least 
99.5% homologous . 

10 In one specific embodiment, the FimH adhesin as defined above 
is one which, when tested for binding to yeast mannan (Mn) , 
human plasma f ibronectin (Pn) , periodate treated Fn and the 
synthetic peptide FnSpl comprising the first 30 amino acids 
of Fn, only binds to Mn. In the following, an adhesin having 
this pattern of binding properties is designated an 
M class FimH adhesin. In other specific embodiments, the FimH 
adhesin is an adhesin which, when tested for binding to the 
above compounds, binds to Mn and Fn (MF class FimH adhesin) 
or an adhesin which among these compounds bind to all of 
20 these (MFP class FimH adhesin) . 

It has been found that bacteria expressing FimH adhesins of 
the above MFP class bind in a mannose- sensitive (MS) manner 
to polyoxyethylene sorbitan monolaurate (Tween 20) and a 
little less well to polyoxyethylene sorbitan monooleate 
(Tween 80). Furthermore, bacteria expressing MFP class FimH 
adhesxns make a much tougher pellicle than bacteria expres- 
sing other types of adhesins. In the present context, the 
term "pellicle" indicates a layer or film of bacteria that 
forms at the air/liquid interface of a liquid growth medium. 
This noticeable phenomenon might be of particular interest 
where there is a reason to concentrate microorganisms at the 
surface of an aquatic environment, such as e.g. bacterial 
cells Which in accordance with the present invention express 
a pollutant -degrading gene product. 
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Another interesting finding is that bacteria expressing a MFP 
class adhesins bind to hydroxyapatite to a higher degree than 
do bacteria expressing a M class adhesin. Hydroxyapatite, 
especially saliva- treated hydroxyapatite is i.a. used as a 
5 model for tooth enamel, and accordingly, this finding indi- 
cates that bacteria esqjressing MFP class adhesins are par- 
ticularly useful in bacterial coit^jositions intended for 
colonization of teeth. 



It has also been found that the MFP class adhesins bind to a 
10 large range of synthetic peptides and accordingly seem to 
have a broad specificity in terms of amino acid motifs. 

In further specific embodiments of the invention, the FimH 
adhesin is an adhesin which, when tested for binding to the 
five Fn- fragments obtained by thermolysin treatment as it is 
15 described in reference No. 51, only binds to the 40-kDa 

gelatin- binding fragment or which binds to all of these Fn- 
fragments, or to none of these. 



20 



In addition to the above classes of FimH adhesins, another 
class has been identified which is designated the m'^ (low 
adhesive) class. Such an adhesin confers the ability to 
aggregate yeast cells in a mannose- sensitive (MS) fashion, in 
titers similar to M class adhesins, but surprisingly, it 
binds at only low levels to Mn or Fn and FnSpl. Furthermore, 
adhesins of this low adhesive M^ class adhere poorly to MDCK, 
25 buccal cells and erythrocytes as compared with M class adhes- 
ins. Example of a M^ class adhesin is one expressed by the 
recombinant E. coli strain KB 23 which differs only from the 
PC31 FimH adhesin by having an alanine instead of a valine at 
residue 27 and the FimH adhesin expressed by the human fecal 
30 E. coli isolate which is designated F-18 [McCormick et al., 
1989 (ref. 34)]. This latter adhesin differs from the PC31 
FimH in three amino acid residues and the F-is isolate has 
been found to colonize the large intestine to a higher degree 
than certain E. coli k-12 strains do. Accordingly, it is 
35 contemplated that these M^ class adhesins confer to 
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gastrointestinal bacteria the ability to colonize the large 
intestine which is significant for a live bacterial vaccine 
for exerting its immunological effect in the gastrointestinal 
tract . 

Furthermore, it has been found that among M class adhesins 
adhesion is found that is not sensitive to inhibition by D- 
mannose. Such a mannose- insensitive (or mannose- resistant) M 
class adhesin is designated in the following as an adhes- 
xn. one example of a bacterial strain expressing an M« adhes- 
xn xs the Clinical isolate U221-3 which is mentioned in the 
following. 

in accordance with the invention, a FimH adhesin as defined 
above can be a chimeric adhesin con^rising amino acid 
sequences from different PimH adhesins. Such chimeras are 
constructed e.g. by providing multiple restriction fragments 
Of a f.mH gene, followed by exchanging under ligation condi- 
tions these fragments with corresponding fragments of an 
other fimH gene and cloning the ligation product as it is 

20 ToZL'^^^ r^'' ' *^ " explained below, 

recombinant plasmids containing such chimeric flnm genes can 
be transformed into a host cell and transformants tested for 
adhesrve phenotype, allowing determination of the regions of 
each gene capable of conferring functional activity (Fig 5, 

25 t^t" T^'r ^^'"'"^ ^"^^^^ below showld • 

that all of the sequence changes relative to the PC31 fl^ 

gene that affected binding function In the studied strains of 

r T to I'll --i^nated ,s 3, 

4, 7 10, F.18 and U221-3, respectively, occurred between 
residues 27 and 119, both included, of the 279 residue 
30 mature flmH sequences. sioue, 

TiZ'^tT'"'' encon^sses in one embodiment a 

tr^ T'T -=i<i -^ence which differs 

from the E. col. PC31 FimH adhesin by at least one amino acid 
occurring between residues 27 and 119 of the mature Pim^ 
5 sequence, including a PimH adhesin comprising an amino acid 



wo 95/20657 PCT/DK95/00042 

19 

sequence which differs from the E. coli PC31 FimH adhesin by 
at least one amino acid occurring between residues 33 and 78 
of the mature FimH sequence. 

The selected potential receptors for a FimH adhesin as 
defined above include those animate and inanimate receptors 
mentioned above for a recombinant bacterial adhesin variant 
and the potential uses of the FimH adhesins are also the same 
as those uses described above for this recombinant bacterial 
adhesin variant. 
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As mentioned above, the invention relates in a further aspect 
to a recombinant replicon comprising a DNA sequence coding 
for a recombinant bacterial adhesin variant as defined herein 
ox a DNA sequence coding for a FimH adhesin as also defined 
herein. Such a replicon is suitably selected from a chromo- 
some or a plasmid. The DNA sequence includes a sequence which 
is inserted by conventional recombination techniques such as 
insertion by means of restriction enzymes and subsequent 
ligation, or the DNA sequence is provided by siobjecting a 
replicon comprising a naturally occurring sequence coding for 
an adhesin to a mutagenization procedure including site- 
directed mutagenesis, insertion of a transposable element, 
mutagenization by radiation or chemical mutagenization, 
followed by selection of cells comprising a mutated sequence 
conferring altered binding properties relative to a cell 
25 comprising the wild- type sequence. 

In preferred embodiments, the recombinant replicon is one 
having a broad host range including bacterial species na- 
turally occurring in soil, in aquatic environments, on inner 
and outer surfaces of humans and animals, and which is com- 
patible with replicons occurring in potential host strains. 

In one useful embodiment, the recombinant replicon as defined 
above is one wherein the DNA sequence codes for a FimH adhes- 
in having an amino acid sequence which differs from the E. 
coll PC31 FimH adhesin by at least one amino acid, including 
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an adhesin having an amino acid sequence which differs from 
the E. coli PC31 FimH adhesin by at least 2 amino acid resi- 
dues, such as an amino acid sequence differing by at least 3 
amino acids, preferably by at least 4 amino acids, more 
preferably by at least 5 amino acids, most preferably by at 
least 6 amino acids. In further useful embodiments, the amino 
acid sequence may even differ by more than 6 amino acids such 
at least 7, preferably by at least 8, more preferably by at 
least 9, even more preferably by at least 10 and in particu- 
lar by at least ii amino acid residues, such as by at least 
12 amino acids including by at least 15. 

Accordingly, the above recombinant replicon preferably com- 
prises a DNA sequence coding for a FimH adhesin which is at 
least 90% homologous to the PC31 fimH gene, such as at least 
92% homologous, more preferably at least 93% homologous, even 
more preferably at least 94% homologous, most preferably at 
least 95% homologous, and in particular at least 96% 
homologous, e.g. at least 97% homologous. In particularly 
interesting embodiments, the adhesin is at least 98% 
homologous, including at least 99% homologous such as at 
least 99.5% homologous. 

In a further embodiment, the above replicon comprises a DNA 
sequence which is a chimeric fimH gene as it has been defined 
above, comprising DNA from different fimH genes. The replicon 
can also be one which comprises a further DNA sequence e.g. 
derived from a microorganism selected from a bacterium, a 
virus, a protozoan, a fungus and a yeast. This furtherDNA 
sequence is e.g. one coding for a heterologous polypeptide 
including an epitope, an antibody, a toxic gene product an 
enzyme, a pesticidally active gene product and a pollutant - 
degrading gene product. 

In useful embodiments, the replicon as defined herein com- 
prises a DNA sequence which is isolated from an Enterobacte- 
rxaceae species, including a DNA sequence which is isolated 
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from E, coli, a Klehslells sp., an Enterobcicter sp., a 
Yersinia sp. or a Salmoaella sp. 

In addition to being a DNA sequence as defined above, the 
sequence can be a synthetic sequence constructed by conven- 
5 tional techniques of DNA synthesis. 

As it is also mentioned above, the present invention 
encompasses a fusion protein comprising a recombinant bacte- 
rial adhesin variant or a FimH adhesin as defined above, and 
a heterologous polypeptide. Such a polypeptide is in useful 
embodiments an immunologically active gene product i.e. an 
epitope (antigenic determinant) from a pathogenic organism, 
which polypeptide, when administered to the body of a human 
or an animal is capable of stimulating the formation of 
antibodies therein, a cell in which such an epitope is 
expressed is advantageously utilized in the preparation of 
live vaccines. Such vaccines have several advantages over 
known live vaccines: 
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Firstly, the epitope forms a structural part of an adhesin 
which is embedded in a surface organelle of the vaccine 
cells. This implies that the hybrid DNA sequence coding for 
the epitope further comprises the means for transporting the 
epitope, when expressed, to the outer surface of the cell, 
i.e. translocating it through the cell membrane. This is 
immunologically highly advantageous, since the epitope will 
be brought more closely in contact with immunologically 
competent cells of the body to which the fusion protein- 
expressing vaccine cells are administered. 

Secondly, the adhesin part of the epitope- carrying fusion 
protein can be selected so as to have specific binding prop- 
erties Whereby the vaccine cell may be targeted to a particu- 
lar location in the body where an immunological response to 
the epitope is desirable. The adhesion of the epitope- carry- 
ing cell to a particular location or region of the body will 
xn this manner ensure that the cell is retained in the human 
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or animal body in that particular location for a period of 
time which is sufficient to obtain the desired immune 
response. 

In accordance with the invention, a useful cell for expres- 
sion of the above fusion protein is one selected from a 
bacterial species which inherently contains an adhes in -carry- 
ing surface organelle. Such species include as examples gram- 
negative species of Enterobacteiriaceae such as E. coli. 
Klebsiella spp, Salmonella spp, Yersinia spp, Vihriona'ceae , 
Hemophilus spp, Bordetella spp and Pseudomonadaceae , and 

gram-positive species such as Neisseria spp and Streptococcus 
spp. 

The epitope part of a fusion protein according to the inven- 
tion can be an epitope derived from any pathogenic organism 
or agent against which it is desirable to develop vaccines. 
Such pathogenic organisms include viruses, bacteria and 
eucaryotic organisms such as fungi, yeast or protozoa. 

Whereas cells expressing an epitope -carrying fusion protein 
as defined herein may be used as a live vaccine, it is also 
wxthxn the scope of the invention to provide isolated and/or 
purxfxed cell surface organelles comprising the fusion pro- 
tein, including fimbriae and pili, as a vaccine, and it is 
also contemplated that useful vaccines may be provided where- 
xn cells expressing an epitope- carrying fusion protein have 
been killed by conventional methods such as formaldehyde 
treatment or thermal treatment. 

in a further embodiment of the invention, the fusion protein 
according to the invention comprises as the non-adhesin 
polypeptide part a toxic gene product e.g. having a selective 
3 0 toxic effect on particular cells in the body such as e g 

cancer cells. By selecting the adhesin part as one having a 
specific binding affinity to receptors in such cells it is 
possible to have cells expressing the toxic gene product 
bound selectively to such target cells whereby these cells 
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may be killed or damaged by the toxic gene product. It is 
also possible to use isolated or purified cell organelles 
containing a fusion protein comprising the cell toxic 
(cytotoxic) gene product for the purpose of targeting the 
toxic product. 

In a further interesting embodiment, the fusion protein 
comprises an antibody. Such an embodiment is, inter alia, 
particularly interesting with respect to the provision of 
fusion proteins which may be used in affinity purification of 
biological compounds having binding affinity to the antibody 
part of the fusion protein, it is contemplated that cells 
expressing as part of a surface organelle, such a fusion 
protein may be utilized directly as a means of concentrating 
a biological compound, or the isolated surface organelles 
comprising the antibody- carrying fusion protein may be used 
for this purpose. 

Furthermore, the fusion proteins as defined herein are useful 
as carriers of non- covalently bound compounds such as pharma- 
cologically active, diagnostically active and imaging com- 
pounds with the purpose of providing cells or cell organelles 
carrying the active compounds, which thereby become targetab- 
le to particular regions or locations of a body to which 
these cells or cell organelles are administered. The inven- 
tion encompasses any combination of a fusion protein as 
defined herein and an active compound which can be covalently 
bound to a fusion protein. 

AS mentioned above, the present invention encompasses in one 
aspect a recombinant bacterial cell which expresses a recom- 
binant bacterial adhesin variant or a FimH adhesin as defined 
above. In one specific embodiment, the bacterial cell is one 
which comprises the above-defined recombinant replicon 
Depending on the field of application of such a cell, it may 
e.g. be selected from a soil bacterium,, an aquatic bacterium, 
a bacterium which is normally associated with plants a 
bacterium which is member of the human or animal indigenous 



wo 95/20657 IV ^jjj^ 



10 



15 



20 



PCT/DK95/00042 
24 



bacterial flora, or a bacterium which is adapted to colonize 
certaxn ecological niches such as e.g. sewage purification 
plants or certain inanimate surfaces. 

The major significant advantages which have been achieved by 
the present invention is the possibility to provide recombi- 
nant bacterial cells which are not only ecologically well- 
adapted to grow in a particular ecological environment, but 
whxch are also provided with means for colonizing more perma- 
nently xn their ecologically natural environment. These means 
for xmproved ability to colonize an environment are the 
adhesins expressed by the bacteria which have been con- 
structed and/or selected so as to enable the recombinant 
bacterxal cell to adhere to or bind to specific receptors in 
the envxronment, i.e. the bacterial cells are targeted to 
that envxronment. Thereby the bacteria according to the 
present invention will have an ecologically competitive 

IZ2^7^ TT'^^ '° particular environment 

Whxch do not have surface structures comprising adhesins 
bxndxng to receptors present in the environment, at least not 
to the same extent as the bacterial cells according to the 
xnventxon. ^ 
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^ct!!Lr -viron^ent-speclfic adhesins which .he 

TtlT / s, the cell Will have a phenotype 

"h.ch .s desirable in the envircnn,ent to which it is tar- 
geted. As one exa:^le, a cell according to the invention 
louf from the hu^nan or .nis^l indige- 
nous bacterial flora typically be one which expresses !n 
epitope including an epitope which is part of a .Z.oZro 
tein expressed by the bacterial cell. As another exar^le ™ay 
be mentioned a bacterial cell which is isolated from a pl"t 
Lcil":: .:^^^^"^ ^ Pe-icidany active confound such'^ra 

Plant root-assocated nitrogen- fixating bacterium isolated 
from soil which in accordance with the invention is provided 
wxth adhesins improving the capability of the bacterL to 
become permanently colonized to the roots of a specific plant 
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or specific plants, or a bacteriiom which is ecologically 
associated with an aquatic or terrestrial environment con- 
taining pollutants to be degraded or removed. 

Accordingly, the recombinant bacterial cell can be derived 
5 from any gram-negative or gram-positive bacterium for which a 
need exists to obtain improved colonization in a particular 
inanimate or animate environment. Such bacteria include as 
exaii^les Bnterobacte:riaceae spp, Hemophilus spp. Neisseria 
spp, Bordetell^ spp. Streptococcus spp, Pseudomonadaceae spp, 
10 Vibrionaceae spp, Baccilaceae spp. 

in certain embodiments of the invention it is advantageous 
that the present recombinant bacterial cell is provided as 
one Which, when it is administered to a particular location 
or environment, will not persist in that environment. Accord- 
15 mgly, such a recombinant bacterial cell may further comprise 
a gene coding for a gene product which, when expressed has a 
killing or cell function- limiting effect in said cell, the 
expression of said gene coding for the cell killing or cell 
function-limiting gene product being regulated in such a 
manner that the bacterial cell when targeted to receptor in a 
specific location will be killed or limited in its function 
m a pre -determined manner. The gene coding for the cell 
killing or cell function- limiting gene product is suitably 
regulated by a factor selected from the group consisting of a 
stochastic event, the presence/absence of a chemical compound 
in the location, and a physical factor. 

in a further aspect, the invention relates to a method of 
isolating or constructing a recombinant bacterial cell ex- 
pressing an adhesin having modified binding properties rela- 
tive to a natively expressed adhesin such as a natively 
expressed FimH adhesin. As it is defined above, this method 
comprises identifying in the bacterial cell DNA sequence (s) 
coding for the binding domain(s) of said natively expressed 
adhesin and substituting at least one codon herein whereby a 
modified adhesin molecule is expressed that is different in 



20 



25 



30 




25 



30 



35 



WO 95/20657 _ 

PCT/DK95/00042 
26 

Z/tT T ""^ -=P-eB3ed natively, 

and selecting a bacterial cell expressing the modified adhei- 
in having an altered adhesion phenotype relative to the 
natively expressed bacterial adhesin. 

genes Ld s '"^ instructing chi.„eric adhesin- encoding 

genes and screening for cells which by having a region in tL 
adhesin gene replaced by a corresponding hete'rologL re:iL 
of a different DNA se<juence. acquires a new binding 

^ "^"-"^ - natively 

adhesins and eo^ression of recon>binant 

: The s^«!tutIon L "^^^'^^^ phenotypes. 

per TZT " "^^ "-"hods know 

per se such as site-directed mutagenesis using synthetic 
Oligonucleotides and ppp t-^^u oy«tnecic 
or by conventi!lr ! '^''""losy or transposable elements 
by conventional radiation or chemical mutagenization. 

sLr"^^° embodiments, the above method includes 

:ZZ: 17 : =™ is associated Ih the 

adhesL is par: ITl Zllla ''"^ recombinant 

As mentioned above tho s^^r^^*-' 

nant bacterial celis tv nTs ei\«:d^::r'^^^^^^ 
Whereby cells with • ^^^'^''^'^ bxndmg properties 

.ents w.ere"heTestc:^".Lt:r::j- ^""""^ 
Phenotype is advantageous. Ac^girtL^: 

aspect Of the invention provided a ^Ll of p::^::,:/:"''" 
recombinant bacterial cell that binds to a sp^ciTc "e^ ptor 
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moiety, comprising introducing into a bacterium that does not 
produce an adhesin binding to said receptor moiety, a DNA 
sequence coding for an adhesin binding to the receptor moie- 
ty, and selecting a bacterial cell expressing the DNA 
5 sequence . 
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The primary objective of this method is to provide the means 
of constructing a bacterial strain having the capacity to 
colonize an environment, based on a parent strain which has 
an inherent, useful phenotype in this particular environment 
but which does not express an adhesin binding to receptor 
moieties in the environment. Accordingly, the method includes 
as a first step the isolation of an environmentally adapted 
bacterium not binding to appropriate receptor moieties and in 
subsequent steps, the identification of heterologous genes 
encoding adhesins which bind to receptor moieties occurring 
in said environment, preferably moieties occurring abundant- 
ly, isolating this gene and introducing it into the above 
parent strain. The adhesin gene may e.g. be a gene coding for 
a naturally occurring FimH adhesin or a recombinant FimH 
20 adhesin as defined above. 

in one useful embodiment of the method, the adhesin- encoding 
gene is introduced by transforming a parent bacterial cell 
with a recombinant replicon as defined herein, in further 
embodiments, the method is designed so as to obtain a cell 
wherein a non-adhesin compound is associated with the adhes- 
in, e.g. by introducing the gene coding for an adhesin as a 
hybrid gene coding for a non-adhesin polypeptide whereby non- 
adhesin compound is expressed with the adhesin as part of a 
fusion protein comprising the adhesin, or by binding non- 
covalently a compound to the adhesin when expressed. 

Besides the above method, an adhesin carrying bacterial cell 
having an altered pattern of adhesion can be provided by 
using a selection procedure comprising contacting an appro- 
priately sized population of wild- type adhesin- carrying 
bacterial cells with a potential receptor moiety to which the 
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wild- type cells do not adhere e cr r, = 

^ etcuiere, e.g. in a manner as it is 

Z IZ J: TV """^^^ spontaneous!, c. .an.o.. 

receptor n,o«ty xn question, become progressively enriched 
ability can readily be isolated and further characterized 

^vi'ortheT ^^'^''^ P-^^ ""jec- 

ta'tina r " = of 

cargeting a compound to a snecifSr. 

t'Tcul'ari'cc":"'""' °' ' ^"''^'^ ^ Par- 

uicuxar location can be ca-r-r-ifi^H ^, 4- • 

example is a screening procedure where cellT 

adhesins or .cno.^ leolated adhesils "e --Pressing taown 

location e.g beina . ^^^"-"^ ^^e administered to the 

herein T*^ w J. ^j-'-'" assays as disclosed 

.ndL^:: :r:e%::toT.rr:: rti- :rtT- - - - 

adhesin<s,. is/are present in the locati^I. 

location may be^^^:: zitz.t^''' °' ^^^^ 

and based upon such dat= 1 " 9s"erated, 

one or more of the id^^if^T"" " ^^"^ " 

he Identified major moieties are selected and 
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their binding to this/these structure (s) is tested e.g. 
according to the assays as used herein. Chemical moieties 
which are considered potential adhesin- interacting receptor 
moieties include as examples glycolipids, glycoproteins, 
i proteins, polypeptides, saccharide moieties and peptides. 

In the case no suitable chemical moiety is identified in the 
location, which is capable of binding to known adhesins or 
which bind with a sufficient affinity, it is required to 
construct a library of modified adhesin molecules based on 
known adhesins which are modified by replacing one or more 
codons as it is explained herein, and/or such a library 
provided by constructing synthetic adhesin molecules, and 
then screening this library for recognition of and interac- 
tion with identified location surface moieties. A library of 
15 modified FimH adhesins may e.g. be selected for specificity 
towards a given receptor by running clones of these adhesins 
through a column or matrix containing the receptor moiety in 
question or cells or tissues isolated from the location 
without knowing what the receptor moiety is. The clone (s) 
20 expressing the adhesins with affinity to receptor moie- 
ty/moieties will adhere/bind to the column or matrix, and can 
subsequently be isolated therefrom. 

It is within the contemplation of the invention that crystal - 
lographic analyses of adhesins, whether naturally occurring 
or constructed as indicated above, is a useful technique for 
the obtainment of information about adhesin structures that 
assumingly will recognize and interact with particular adhes- 
m receptor moieties . 



25 



30 



35 



In accordance with the invention, one embodiment of the above 
method is one wherein the isolated bacterial cell expresses 
an adhesin having modified receptor moiety-binding properties 
relative to an adhesin natively expressed by the cell, the 
isolation Of the cell comprising identifying in a parent 
bacterial cell, dna sequence (s) coding for the binding 
domain (s) of said natively expressed adhesin and substituting 
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at least one codon herein, whereby a modified adhesin mole- 
cule is expressed that is different in at least one amino 
acid from the adhesin expressed natively, and selecting a 
bacterial cell expressing the modified adhesin having an 
altered adhesion phenotype relative to the natively expressed 
bacterial adhesin or a method wherein the bacterial cell 
expressing an adhesin that recognizes and binds to the 
receptor moiety is a recombinant bacterial cell derived from 
a parent bacterial cell that does not produce an adhesin 
binding to said receptor, by inserting into the parent cell a 
DNA sequence coding for an adhesin binding to the receptor 
moiety, and selecting a bacterial cell expressing the DNA 
sequence . 



15 



One primary objective of the present invention is the target- 
ing of useful non-adhesin conpounds to a particular location. 
Accordingly, the invention encompasses in an interesting 
embodiment a method as defined above wherein a non-adhesin 
compound is associated with the adhesin, whereby said com- 
pound is targeted with the adhesin to the location coir^jrising 
20 the receptor moieties recognizable by the adhesin. 

The compound can be associated with the adhesin by a covalent 
binding or by any of the above mentioned non- covalent types 
of molecule interaction forces. 



25 



When associated covalently with the adhesin the compound to 
be CO- targeted to the selected location with the adhesin can 
be an enzyme, an antibody, an epitope or a toxin which is 
part of a fusion protein comprising the adhesin. A compound 
which is associated with the adhesin by a non- covalent bind- 
ing is typically a pharmacologically active, diagnostically 
3 0 active or imaging compound. 

Locations to which it is desirable to have an adhesin tar- 
geted by the present method include a human or animal sur- 
face, a plant surface and an inanimate surface as defined 
above. 
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In one specific embodiment of the present method the bacteri- 
al cell being administered to the location expresses a recom- 
binant bacterial adhesin variant derived from a naturally 
occurring parent adhesin, said adhesin variant having altered 
binding properties relative to the naturally occurring adhes- 
in from which it is derived, the altered binding properties 
including binding to at least one receptor moiety to which 
the parent adhesin does not bind. Such an adhesin variant is 
advantageously derived from a naturally occurring adhesin 
isolated from a cell structure selected from the group con- 
sisting of a capsule, a lipopolysaccharide layer, on outer 
membrane protein, a flagellum, a pilus, a fimbria, a non- 
fimbrial adhesin (NFA) or an afimbrial adhesin (AFA) . 

In specific embodiments of the invention, the above adhesin 
variant as used in the present method is a protein having an 
amino acid sequence differing in at least one amino acid 
residue from its parent protein adhesin such as a FimH adhes- 
in having an amino acid sequence which differs from the E. 
coll PC31 FimH adhesin as defined herein in at least one 
20 amino acid. Such a FimH adhesin includes an adhesin which 
binds to a receptor selected from the group consisting of a 
domain where mannosyl residues are not terminal and a domain 
devoid of saccharide and an adhesin variant which is at least 
90% homologous to the PC31 FimH adhesin as defined herein, 
such as at least 92% homologous, more preferably at least 93% 
homologous, even more preferably at least 94% homologous 
most preferably at least 95% homologous, and in particular at 
least 96% homologous, e.g. at least 97% homologous. In par- 
ticularly interesting embodiments, the adhesin is at least 
98% homologous, including at least 99% homologous or at least 
99.5% homol ogous . 

The above FimH adhesin can be a chimeric adhesin as defined 
above, comprising amino acid sequences from different FimH 
adhesins and constructed according to the methods below 
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In accordance with the invention, an adhesin can be adminis- 
tered to a location in the form of an adhesin- expressing 
bacterial cell . Such a cell is one capable of growing in that 
particular location. Accordingly, the bacterial cell is 
suitably derived from a bacterial species which is normally 
occurring in the location including human or animal body 
surfaces, plant surfaces such as plant root surfaces and 
inanimate surfaces. In this context, an animal body surface 
includes the insect gut, whereto it is desirable to adminis- 
ter a bacterial cell expressing an insecticidally active 
toxin. 



Thus, if it is desired to administer the bacterial cell to 
the root of a plant, a suitable bacterial cell is preferably 
isolated from a strain which has colonized the rhizosphere of 

15 that plant to a large degree, i.e. the strain is a major 
member of the natural plant root flora. Such an isolate is 
then provided with a gene coding for an adhesin which will 
recognize and interact with an abundantly occurring moiety on 
the roots of said plant. In this manner, a suitable adhesin 

2 0 which is expressed naturally in a bacterium which is not 

adapted to grow in a plant rhizosphere, becomes expressible 
xn a normal inhabitant of the rhizosphere environment (loca- 
tion) • 



In specific embodiments of the present method of targeting a 
bacterial adhesin to a specific location, the adhesin is a 
FxmH adhesin as defined above, having an amino acid sequence 
Which differs from the E. coli PC31 FimH adhesin as defined 
herein in at least one amino acid. 

In an interesting embodiment, the adhesin- carrying bacterial 
cell being targeted is a cell which further comprises a gene 
coding for a gene product which, when it is expressed, has a 
kxllxng or cell function- limiting effect in said cell the 
expression of said gene coding for the cell killing or cell 
function-limiting gene product being regulated in such a 
manner that the bacterial cell, when targeted, will be killed 
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or limited in its function in a pre-detennined manner. The 
expression of such a "suicide" or cell function- limiting gene 
may suitably be regulated by a factor selected from the group 
consisting of a stochastic event, the presence /absence of a 
5 chemical compound in the location and a physical factor. As 
examples of such "suicide" or cell function- limiting genes 
providing the means of biological containment, may be men- 
tioned those disclosed in WO 87/5932 and WO 93/20211 

Furthermore, the present Designer Adhesin Technology (DAT) 
10 provides very useful means of obtaining colonization with 
desirable bacteria in a particular environment with the 
purpose of obtaining beneficial changes of the microbial 
flora in the environment. As one example, certain bacterial 
species in the gastrointestinal (Gl) tract of humans and 
15 animals have beneficial effects on the health condition of 

the host organism e.g. by suppressing pathogenic organisms or 
by contributing to the digesting of certain diet components. 
The present technology makes it possible to select particu- 
larly useful bacteria from the GI- tract and have them 
designed in accordance with the present invention, to have 
improved colonization abilities. Similar examples include 
desirable bacterial colonizations of biological sewage puri- 
fication systems, plants where invasion of pathogenic organ- 
isms may be controlled by colonizing the plants with harmless 
bacteria, and teeth where caries may be controlled by coloni- 
zing the dental enamel with bacteria suppressing those caus- 
ing the caries attacks. 
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In another industrially interesting aspect, the invention 
provides the means of isolating a compound from a solution or 
suspension containing the compound. The method comprising 
contacting the solution or the suspension with a fusion 
protein as defined herein wherein the heterologous 
polypeptide has an affinity to the compound to be isolated. 

Furthermore, the invention provides a composition comprising 
a population of a bacterial cell as defined herein. 
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The invention is further illustrated in the below Examples 
and the Figures, wherein 

Fig. 1 is a schematic model for the construction of recom- 
binant plasmids pGBl-24 (containing fimH from CI #10) and 
5 PGB2-24 (containing fimH from PC31) used for transforming E. 
coli AAECl9iA(pPKL114) with cloned fimH genes. Plasmid pGB2- 
24 was used as the vector for all other cloned fimH genes 
described herein; 



10 



15 



Fig. 2 is a restriction map of fimH genes. Five unique res- 
triction sites are present in the PC31 fimH gene. Numbers in 
parentheses following enzymes are the base pair positions of 
the cut sites. Some of these sites are found in the other 
fimH genes, as marked. Chimeric genes were produced by ex- 
changing each available restriction fragment from the other 
five fimH genes with corresponding fragments in the PC31 gene 
and then recombinant strains expressing resulting chimeric 
fimH subunits were tested for adhesion. Fragments indicated 
by boxes are those which conferred MF or MFP adhesive 
phenotypes on the chimeric genes; 

20 Fig. 3 illustrates adhesion of representative "wild- type" (A) 
and recombinant (B) M- class, MF- class and MFP- class strains 
to Mn (1), Fn (2), periodate- treated Fn (3) and to FnSpl (4). 
Strain designations given for the "wild- type" strains are 
given in AS. Strain designations KB31, KB12, KB4, KB7, KB50 

25 and KBIO, are for recombinant strains of AAECl9lA(pPKL114) , 
which is fimH- , after transformation with plasmids that 
contain fimH*- from strains HBlOl (pPKL4) , CI #12, CI #4, CI 
#7. CSH-50 and CI #10, respectively. Open columns indicate 
results when bacteria were incubated in buffer without D- 

30 mannose, while solid columns are results in the presence of 
D-mannose. Values indicated are the mean ± S.E.M. (n=4) for 
each column; 

Fig. 4 illustrates the adhesion of representative M- class 
MF-class and MFP-class strains (CIs #12, #4 and #10, respect- 
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ively) to Fn fragments prepared by thennolysin treatment as 
described in ref. 51. Columns labelled 1-5 indicate adhesion 
to: 1) NH2-terminal 30-kDa domain; 2) the 55-kDa gelatih- 
binding domain; 3) the llO-]cDa cell attachment domain; 4) the 
29-38-kDa heparin binding domains; and 5) the 20-kDa COOH- 
terminal domain. Open colixmns represent adhesion in the 
absence of D-mannose; solid columns represent adhesion in the 
presence of D-mannose. Mean + S.E.M. (n=4); 

Fig. 5 is a conposite figure illustrating coit?3arison of amino 
acid sequences of FimH adhesins and active restriction frag- 
ments of fimH genes. The published nucleotide and deduced 
amino acid sequence of the PC31 flmH gene and gene product 
(ref. 27) serve as prototype. Numbered amino acid residues 
shown above the model of the PC31 FimH represent residues 
that are different in other FimH subunits due to amino acid 
substitution or deletion. Standard one- letter code applies 
and residues in the other FimH sequences that are different 
are indicated. Deleted amino acids are indicated by A. It 
should be noted that residue 176 is not arginine as published 
previously (ref. 27) for the PC31 FimH, but proline. Regions 
of the FimH subunits conferring change in adhesive phenotype, 
highlighted in bold, were determined by functional assays 
performed on chimeras between the "classic" mannose -specific 
PC31 fimH gene present in HBlOl (pPKli4) and the above 
described genes. Residues predicted to be key in conferring 
receptor specificity are circled. Approximate positions of 
unique restriction sites used to create chimeras are indi- 
cated along the bottom of the model; 

Fig. 6 illustrates plasmid pPKL4 which is a derivative of 
PBR322 (thick line) carrying the entire fxm operon (FimA-H) 
including the regulatory genes flmB and fimE (not shown) , and 
the promoter region with the SnaBl site. In this plasmid an 
8mer linker with an Sgrlii site was inserted in the SnaBX site 
to create pPKL83; 
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Fig. 7 illustrates the construction of plasmid pSM1314; the 
vector PVLT33 is a derivative of the broad host range replic-* 
on RSFIOIO. Plasmid pPKL83 was digested with B^lix and pVLT 
was digested with BairtHl; the two were ligated and pSMl3i4 was 
5 the resulting plasmid in which expression of the fimA-H 
cluster is under the control of the tac promoter; 

Fig. 8 illustrates plasmid pLPA22 and derivatives hereof as 
used in this study. The triangles indicate the position of 
translational stop- linkers in the fimH gene in plasmid 
10 pPKLiis. The positions of heterologous inserts are indicated 
(black boxes) . Small triangles indicate signal -peptide encod- 
ing sectors . 

Fig. 9 illustrates plasmids pLPA29, pLPA30, pLPA36, pLPA58, 
pLPA59 and pLPA98; 

15 Fig. 10 shows immuno- electron microscopy with colloid gold 
labelling of E. coll HBlOl cells containing plasmids pLPA22 
plus PPKL115 (a), PLPA37 plus pPKLllS (b) , pLPA38 plus 

PPKL115 (c). using anti-pre-S2 monoclonal antiserum. Bar, 0.1 

um. 



im. 

20 EXAMPLE 1 
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Functipnal heterogPnei f. y of rype ^ fimh^i;,i .Hheair,.. ^no .r. 
minor secmence variations among fT-ntfj qen^s 



Irl. Materials and ^^«=>^h>of^a 
1.1.2. Reagent a 



30 



Yeast Mn, a polymannosylated glycoprotein isolated from 
Sacch^romyces cere^r±siae cell walls, was obtained from a 
commercial source (Sigma Chemical Co, St. Louis, MO, U S A ) 
Mannan is composed of an N-linked backbone of /31,2-linked 
mannopyranose units with cv- linked mannopyranose side chains 
(ref. 38). The majority of the carbohydrate of human plasma 
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Fn is corrposed of N-glycosidic complex- type biantennary 
glycans and no high mannose- type or hybrid- type N-glycans 
have been described (refs. 30, 45, 54). Human plasma Fn and 
Fn fragments were purified as described previously (refs. 5, 
5 15, 51, 58). Periodate treatment was performed as described 
previously (ref. 51). The synthetic peptide, FnSpl, copying 
the first 30 amino acid residues of the Fn molecule (EAQQMVQ- 
PQSPVAVSQSKPGCYDNGKHYQI) was synthesized in the Protein 
Chemistry Laboratory of the VA Medical Center, Memphis, TN 
10 (SEQ ID NO: 2) . The saccharide content of the four substrata 
was characterized using two lectins, concanavalin A (ConA) , 
well known to react with terminal and internal mannosyl 
residues, and the Calanthus nlvalia agglutinin (GNA) , which 
recognizes only terminal Manal-3Man, Manorl-SMan and Manal- 
2Man sequences (E. y. Laboratories, San Mateo, CA) . immobi- 
lized Mn and Fn both reacted with ConA, whereas GNA bound 
only to Mn. These results are consistent with the known 
structures of the oligosaccharide moieties of these two 
compounds. Neither lectin reacted with immobilized FnSpi. 
Periodate treatment (ref. 51) of Mn or Fn eliminated lectin 
reactivity. 

1.1.3. Bacterial s trains and p Taam-ir»«= 

The CSH-50 strain (lambda" , r'araA riac-pro; rspL thd 
fimE::isi) is a Cold Spring Harbor K12-derived strain (ref. 
25 35). The E. coli strain MG 1655 (CGSC6300; K12 derivative, 
lambda-, F-) and a derivative strain AAEC191A (MG1655 recA ' 
Afim were generously provided by Dr. Ian Blomfield (Bowman 
Gray University, Winston -Salem, NC) . AAEC191A has had the 
entire fim gene cluster deleted by allelic exchange (ref. 8). 

30 Clinical isolates (CIs) were urinary tract isolates obtained 
from the clinical microbiology laboratories of the Memphis VA 
Medical center or The City of Memphis Hospitals, Memphis, TN. 
The 12 CIS used in this study were selected on the basis of 
MS agglutination of yeast cells after growth in broth, a 

35 classic test for type 1 fimbriae. 
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Plasmid pPKL4, a pBR322 derivative containing the entire fim 
gene cluster from E. coll strain PC31 (K12- derivative, ga.1 
tonA phx ara) and encoding for the e^ipression of fully func- 
tional type 1 fimbriae in HBlOl (supE hsdS recA a.ra proA lacY 
ga.lK rspL xyl mtl AfimBE) , has been described previously 
(ref . 28) . pPKLll4 is a recombinant plasmid derived from 
pPKL4, but with a translational stop-linker inserted into the 
JCpnl site in the fimH gene. No transcriptional effects of the 

stop-linker are to be e3cpected. Antibiotics were used at the 
following final concentrations: anpicillin (50 /xg/ml) , 
kanamycin (60 /xg/ml) and chloramphenicol (30 /xg/ml) . 

1.1.4. PQlymP>rase cha in react^inn 

Oligonucleotide primers were designed using the published 
sequence for the fimH gene in pPKL4 (ref. 27). The 5' primers 
copied regions 13 and 49 bp upstream from the fimH gene and 
were extended on the 5' end by an Apall restriction site and 
a GC clamp: Primer 1: 5 ' -GGGGG-GTGCAC-ACC TAG AGC TGA ACC 
CGG-3' (SEQ ID NO: 3); Primer 2: 5'-GGGG GTGCAC T CAG GGA ACC 
ATT CAG GCA-3' (SEQ ID NO:4). The 3' primers copied 18 bases 
of the bottom strand of the fimH gene that encode for the 6 
terminal amino acids of fimH and were extended by an Fspl or 
Spill site and a GC clamp: Primer 3: 5'-GGG TGCGCA TTA TTG 
ATA AAC AAA AGT CAC - 3' (SEQ ID NO: 5); Primer 4: 5'-GGG 
GCATGC TTA TTG ATA AAC AAA AGT CAC- 3' (SEQ ID NO: 6) . Primer 1 
and 3 were used for CI #10 and pPKL4, primer i and 4 were 
used for CI #4 and CSH-50 and primer 2 and 4 were used for CI 
#s 7 and 12 to generate PCR products from plasmid or 
chromosomal DNA prepared from E, coli expressing different 
functional classes of type 1 fimbriae. The PCR reaction 
mixture consisted of template DNA, primer pairs, dNTPs, and 
Tag DNA polymerase in PCR buffer. The PCR was performed in a 
Perkin- Elmer Cetus automatic thermal cycler with denaturation 
at 96°C for 1 min., primer annealing at 55«C for 1 min., and 
primer extension at 72»C for 2 mins. for a total of 40 
cycles. All of the PCR products migrated similarly in agarose 
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gels. Purification, restriction and ligation of DNA was 
performed using standard procedures (refs. 39, 48). All 
primers for PGR and for nucleotide sequencing were produced 
by the Molecular Resources Center, UT, Menphis, 

5 1.1.5. Subeloning 

The PGR products from GI#10 and from pPKL4 were cut with 
respective restriction enzymes and ligated into the Apall and 
Fspl restriction sites of plasmid pACYG177 (New England 
Biolabs, Beverly, MA, U.S.A.) which is compatible with the 
10 PBR322 -based pPKL114 to be used in complementation experi- 
ments, creating plasmids pGBl and pGB2, respectively (Fig. 
1) . However, it became inconvenient to use pACYGl 77 -based 
plasmids because of a high frequency of appearance of sponta- 
neous Km^ in the AAEC191A host strain. The origin of this 
problem is not entirely clear, but it was avoided by subclon- 
ing the fimH genes from pGBl and pGB2. The inserts and 
upstream regions of pACyci77 containing the tet promoter were 
cut from pGBl and pGB2 with Fspl and BamHl and subcloned into 
the polylinker site of pGEM-3Z (Promega, Madison, wi) that 
had been cut with BamHl and Hinc2 , creating plasmids pGBll 
and PGB2-1 respectively. pGBM-3Z was simply used as a con- 
venient intermediate in subcloning into pACYC184. 

The inserts were cut out again using Smal and Hind3 and 
subcloned into pACYC184 (New England Biolabs, Beverly, MA) 
cut with Hinc2 and Hind3, creating plasmids pGBl-2 and pGB2- 
24 containing the fimH genes from CI#10 and pPKL4 , respect- 
ively. These plasmids complement the non- adhesive defect of 
AAEC191A(pPKlall4) giving the adhesive phenotypes of the 
parental strains (see Results) . Cutting the fimH gene from 
PGB2-24 using Apall and Sphl makes it possible to easily 
insert other fimH genes obtained by amplifying chromosomal 
DNA of other isolates by PGR. All recombinant strains we have 
tested thus far using this technique exhibit the same adhes- 
ive phenotype as the parent strains from which the fimH genes 
35 were cloned. 
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X.lr^r Construction of chimg rie fimw go r^^o 

Unique restriction sites (Pig. 2) were used to construct 
chimeric fimH genes between the prototypical MS pPKL4 flmH 
gene, used as genetic background, and restriction fragments 
obtained from the newly described flmH genes. Fragments were 
purified from agarose gels and ligated into restriction 
"spaces" generated in the pPKL4 fimH gene present in pACYCl-84 
(PGB2-24) . Each chimera was analyzed by restriction mapping 
and the nucleotide sequences of bridging segments were deter- 
mined to ensure proper constructions. The plasmids containing 
chimeric fimff genes were transformed into AAECl9iA(pPKLll4) 
and clones were tested for agglutination of yeast cells and 
for adhesion to Mn, Pn and FnSpl. 

1.1.7. Nlir-len^irlo geqii^^f' i rfr^ 

15 The nucleotide sequences of fln^ genes were determined by the 
dideoxynucleotide chain termination method of Sanger (ref 
49) using a Sequenase li® kit (U.S. Biochemical Corp. Cleve- 
land. Ohio) and [or-35s3dATP (800 to 1.000 Ci/mmol) according 
20 w! '""h f ""'""^^^^^'^ suggestions. The amino acid sequences 
2 0 were deduced from nucleotide sequences using MacVector* 

protein and DNA analysis software (Eastman Kodak, Rochester, 
NY) . To ensure fidelity of the PCR amplification, selected 
fimff genes were re-amplif ied, cloned, tested for activity and 
re-sequenced. More recently, we have used the £moI« Polymer- 
25 ase Sequencing System (Promega, Madison, WI) , because it is 

useful with small amounts of DNA and thus subcloning the flmH 
genes from the pACYCl 84 -based plasmids to high copy number 
Plasmids was obviated. Bands were visualized by autoradio- 
graphy of sequencing gels and compared with the published 
JU fimH gene sequence (ref. 27) . 



l.l.fi 



Yeast cell aggT-or Tabinn aae=iy 



E. coll were tested for their ab-i i ■! t-^^ =^ 

'^"eir atjiiity to aggregate yeast 

cells. Commercial baker's veas^ 

s yeast, Saccharomyyces cerevlslae. 
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was suspended in PBS (5 mg dry weight /ml) . B. coli were 
washed in PBS, resuspended to an OD530 of 0.4, and mixed with 
the yeast cell suspension in PBS with or without 1% D- 
mannose. Aggregation was monitored visually and the titer 
5 recorded as the last dilution giving a positive aggregation 
reaction. 

1.1.9. Adhegion assays 



Adhesion assays were performed as described previously (ref . 
51) . Briefly, microtiter assay wells were coated with 

10 receptor molecules as indicated in the text and figure leg- 
ends. After the wells were washed two times with PBS, 100 nl 
bacterial suspensions were added in 0.1% BSA-PBS. After 
incubation at 37<»C for indicated times, wells were washed 
three times with PBS and adherent bacteria were detected by 

15 using rabbit anti-B. coli serum. Antibody binding was 

detected using peroxidase -conjugated goat anti -Rabbit IgG. 
Reaction product generated from the 5 -aminosalicylic acid 
substrate was measured at 405 nm after 10-15 minutes by using 
an automatic microplate reader (Molecular Devices, Inc., 

20 Menlo Park, CA) . Values reported are corrected for bac)cground 
reaction using BSA coated plates as control. 



1.2. Results 



In a previous publication it was reported that type 1 
fimbriae of B. coli CSH-50 and HBlOl(pPKL4) differ func- 
25 tionally in their pattern of adhesion to Mn, Fn, periodate- 
treated Fn and a synthetic peptide. FnSpl, immobilized on 
plastic (ref. 51). Since CSH-50 and HB101(pPKL4) are labora- 
tory strains, we tested 12 clinical B. coli isolates (CIs) 
obtained from human urine for adhesion to these four substra- 
ta. All of the CIS agglutinated yeast cells in a MS fashion. 
Five of the twelve CIs adhered only to Mn. The adhesive 
activity of HB101(pPKL4) and of CI #12 are shown as examples 
of this class, which we have tentatively designated as M 
class (Fig. 3A) . Three of the 12 CIs adhered to Mn and Fn, 
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but not to periodate- treated Fn or to FnSpi. The adhesive 
activities of CI #s 4 and 7 are shown as examples of this 
class, designated as MF class. Three of the twelve CIs 
adhered to each of the substrata. The adhesive activities of 
CSH-50 and CI #10 are shown as examples of this class, desig- 
nated as MEP class. 
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Adhesion of strains representing these three classes to Fn 
fragments further illustrates the distinct differences 
between the three classes. The M class CI #12 does not adhere 

10 ^to any of the Fn fragments (Fig. 4). The MF class CI #4 

adheres to the 40-kDa gelatin-binding fragment. The MFP class 
CI #10 adheres, with only slight differences, to all 5 frag- 
ments Of Fn tested. Periodate treatment eliminated binding of 
CI #4 to domain 2, but had no effect on the binding of CI #10 

15 to any of the Pn domains (data not shown) . 

Since the fimH subunit has been shown to mediate the mannose- 
sensitive activity of type 1 fimbriae, we focused our initial 
efforts to elucidate the molecular basis for the observed 
functional heterogeneity on the fimH gene. rimH genes were 
amplified from chromosomal (or plasmid, for pPKL4) DNA and 
the genes were cloned into pACYC177 and subcloned into 
PACYC184 under control of the ^-lactamase promoter of 
PACYC177, according to Materials and Methods (Fig. i) 

The adhesive phenotypes conferred by the fiwH genes were 
tested in the following way. E. coll K-12 strain AAEC191A 
(Afim) was first transformed with plasmid pPKLli4, which 
contains an intact fin, gene cluster but with a translational 
stop-linker inserted into the last gene, fimH. This deriva- 
tive produces morphologically normal fimbriae that are non- 
adhesive due to absence of the FimH subunit. Plasmids har- 
bouring cloned fimH genes were transformed into E coli 
AAECl91A(pPKLll4) and the resultant strains were tested for 
their ability to adhere to Mn, Fn, periodate- treated Fn and 
to Fnspl (Fig. 3B). Each of the recombinant strains displayed 
adhesive phenotypes mimicking those of the representative 
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parent strains from which the fxmH genes were obtained. fimH 
genes were cloned from each of the other 8 CIs and similar 
results were obtained with the adhesion of recombinant 
strains mimicking that exhibited by the parental CIs. 

5 The complete nucleotide sequences of each of the six repre- 
sentative fimH genes were determined and the amino acid 
sequences of the fimH proteins were deduced as it is shown in 
Table 1 below which is a representation of amino acid 
sequences of the FimH subunits deduced from nucleotide 

10 sequences of selected fimH genes disclosed in this example 
[CI#3 (SEQ ID NO:33), CI#4 (SEQ ID N0:29), CI#7 (SEQ ID 
N0:30), CI#10 (SEQ ID N0:31) and CI#12 (SEQ ID NO:28)] and 
those of the E. coli K12 strain PC31 (SEQ ID N0:1) and E. 
coli strain CSH-50 (SEQ ID N0:32) • Additionally, the FimH 

15 amino acid sequences of the following clinical isolates of E. 
coli are shown: KB21 (SEQ ID N0:27) , KS54 (SEQ ID NO:35) , 
U221-3 (SEQ ID NO:36), MJ#9-3 (SEQ ID N0:37), MJ#31-3 (SEQ ID 
NO:38), MJ#ll-2 (SEQ ID NO:39), iyU#2-2 (SEQ ID NO:l) and F-18 
(SEQ ID NO:34). Standard one-letter code applies. Deleted 

20 amino acid residues are indicated by As. M, M^, MF, MFP, and 
indicate the adhesin class as defined above. 
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Table 1, Amino acid s equences of the FimH proteins deduced 
f?rom nucleotidR semienr e s of fimH aeneg of clinical isolates 
disclosed in this examnle and of E. coll K12 strains PC31- and 
CSH-50 
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and gene products should also be identical, subunit incorpor- 
ation into the fimbrial superstructure should not vary sig- 
nificantly. These results en^hasize that in these experiments 
it is the FimH subunit that determines receptor specificity. 

5 In comparing the new FimH sequences to the one published 

previously (ref. 27), the only structural alteration that can 
be clearly linked to a functional change, without resorting 
to analysis of chimeric fimH genes, is the non- conservative 
substitution of arginine^® in the MFP class CSH-50 FimH 
10 subunit for leucine^® in the M class PC31 FimH subxmit. Since 
each of the other FimH sequences had more than one change, it 
was necessary to construct chimeric genes to begin to focus 
on functionally relevant changes. 

In the case of the CI #10 FimH, an MFP class adhesive activ- 
15 ity similar to that of CSH-50 is conferred by a different 
region of the gene which encodes for a subunit deleted of 
residues 116-119. it remains to be determined how two dis- 
tinctly different structural changes can bring about appar- 
ently similar changes in receptor specificity. It is pos- 
20 sible, of course, that as additional receptor molecules are 
tested, these two variants will be found to be functionally 
distinct. 



The Apall-rthllll fragment of the CI #7 fimH gene confers MF 
class activity in the CI#7/PC3i fimH chimera. Since the 
asparagine^«-threonine^^ substitution is within the leader 
sequence and thus not represented in the mature protein, the 
histidine33- asparagine33 substitution must be of functional 
importance for the MF class CI #7 FimH. Comparison of the 
active regions of the MF class CI #4 and the M class CI #12 
FimH subunits suggests the importance of the glutamic acid"'^- 
glycine''^ substitution for MF class activity of the CI #4 
FimH. Thus, histidine", arginine^a, glutamic acidS'' and 
deleted glycine^"- isoleucine^" appear to be key residues in 
the functional activity of FimH subunits of CI #7, CSH-50, CI 
#4 and CI #10, respectively, but a more precise demonstration 
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of which residues are involved and how they affect the 
ligand- binding cleft (s) remains to be performed. 

At first glance, the FimH mediated, mannose- sensitive pro- 
tein-binding activity of type 1 fimbriae is the most surpris- 
ing of the adhesive phenotypes described here. However, 
protein- binding activity of FimH (i.e. PilE) subunits was 
noted earlier in a study characterizing jnutr- induced muta- 
tions in the fimH (pilE) gene (Harris et al., ref. 22) How- 
ever, the protein-binding activity described by Harris et al. 
was not mannose- sensitive. It is presently not known whether 
the protein-binding activity described herein is in addition 
to or separate from the mannose -binding activity, but the 
concept of bifunctional properties of lectins has been estab- 
lished for several years (ref. 6). While the MFP class type 1 
fimbriae appears to react somewhat promiscuously with most Fn 
fragments, the reaction does not appear to be non-specific. 
For instance, the MFP class CSH-50 type 1 fimbriae do not 
adhere well to gelatin (ref. 51). Further, the adhesion to 
ovalbumin is sensitive to both periodate and glycosidase 
20 treatment (ref. 51). Further work will be required to deter- 
mine the consensus amino acid motif reactive with this class 
of FimH subunit. 
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Previous studies suggested that the combining site of the E. 
coli FimH adhesin is in the form of an extended pocket cor- 
responding to the size of a trisaccharide with an associated 
hydrophobic region (ref. 16). The MS nature of all of the 
adhesive interactions described suggests that if the combin- 
ing sites are separate, they may be close to each other. 
However, it remains to be determined whether or not the 
mannose effect is direct or allosteric. Conformational 
changes that frequently occur in lectins upon binding the 
saccharide ligand (ref. 46) could affect a second, distant 
binding site. Site-directed mutations may be sufficient to 
clarify which structural changes result in changes in 
receptor specificity. However, such studies are unlikely to 
shed much light on how the structural changes actually relate 
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The nucleotide and deduced amino acid sequences of the pPKL4 
fiioH gene are identical to that reported previously, except ' 
that residue 176 is not an alanine residue as previously 
reported, but a proline residue. Independent re-amplifica- 
tion, re-cloning and re-sequencing confirmed this sequence 
for the PPKL4 fimff gene. Sequencing was also repeated on 
independently amplified and cloned isolates of the CI #10 and 
CI #7 f imH genes to confirm sequence fidelity and no errors 
were found. 

The nucleotide and deduced amino acid sequences of the other 
fimH alleles described in this Example are > 98% conserved, 
but there is more than one amino acid residue difference in 
all but one of the new f±mH sequences when compared to the 
published pPKl.4 sequence. To focus on the sequence differ- 
ences that resulted in changes in functional activity, advan- 
tage of unique restriction sites were taken (Fig. 2) to 
construct chimeric fimff genes. Multiple restriction fragments 
covering the entirety of each of the sequenced f±mH genes 
were exchanged with corresponding fragments in the prototypi- 
cal flroH gene of E. coll strain PC31 that was amplified from 
PPKL4, cloned into pACYC184 and used as the genetic back- 
ground. Recombinant plasmids containing the chimeric fimH 
genes were transformed into E. coli AAEC191A (pPKL114 ) and 
transformants were tested for adhesive phenotype, allowing 
determination of the regions of each gene capable of confer- 
ring functional activity (Fig. 5). All of the sequence 
Changes that affected function occurred between residues 33 
and 119 of the 279 residue mature fimH sequence 



1.3. D iscuggi 
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The functional heterogeneity which is described above must be 
due entirely to allelic variants of the fimH gene. The only 
variables in the recombinant strains which are described in 
this Example are the fimH genes; all other genes necessary 
for fimbrial subunit synthesis, transport and assembly are 
the same in each case. Since the ratios of the various genes 
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to the ligand- binding cleft (s) and it will ultimately be 
necessary to determine the 3 -dimensional structure of FimH or 
FimH fragments crystallized in the presence of ligand to 
fully understand structure/function relationships. 

The three adhesive classes of type 1 fimbriae identified 
above may understate the functional heterogeneity of type l 
fimbriae. The group of CIs that has been tested in this 
Example is small and only a few substances have been tested 
as potential receptors. A larger group of isolates tested 
against additional receptor candidates might yield additional 
functional classes. Preliminary studies with MS Bnterobacter 
aerogenes and Klebsiella pneumoniae strains exhibiting MF 
class and MFP class activity suggest that heterogeneous 
receptor specificities will also be found among other type l 
15 fimbriated enterobacterial species. 

It is also believed that it is possible that adhesins from 
some fimbriae responsible for mannose- resistant hemagglutina- 
tion or adhesion are structurally related to PimH, but with 
sequence alterations that eliminate sensitivity to mannose. 
The possibility that the MS lectin- like properties of FimH 
might be eliminated while retaining other adhesive properties 
of FimH (e.g. pellicle formation) has been shown previously 
in a study characterizing mutr- induced mutations in the flmH 
(pilE) gene (ref. 26). At the minimum, it is believed that 
tests for type 1 fimbriation should include additional func- 
tional characterization. While all type 1 fimbriae -mediated 
adhesion which have been described in this Example is 
mannose-sensitive, it is not all mannose- or even saccharide- 
specific as has commonly been thought. Further studies of 
type 1 fimbriae as a virulence factor must be able to distin- 
guish among the various functional classes. 
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Allelic variation of the so-called G adhesins of P fimbriated 
uropathogenic E. coli also results in different functional 
classes, but the requirement for the Galal-4Gal sequence 
35 within isoreceptors is maintained (refs. 52, 53). These 
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differences in G adhesin receptor specificity appear to be 
rather subtle, at least superficially, when compared to the 
differences in FimH receptor specificities. Yet there is 
significantly greater sequence homology among the fimH genes 
than among the G adhesin genes, some of which share less than 
50 percent homology. The G adhesin receptor specificities 
affect host susceptibility, due in large part to host-speci- 
fxc expression of glycolipid isoreceptor variants. Whether- 
the FimH family of adhesins bears a similar relationship to 
host susceptibility or tissue tropism remains to be deter- 
mined, in this regard it is possible that the G adhesin 
family could exhibit additional receptor specificities not 
restricted to the GalQrl-4Gal sequence. The lectin- independent 
affinity of P fimbriae for immobilized Fn is not dependent on 
the G adhesin, but on two other minor subunits, E and F 

neither of which bear significant homology to FimH (refs 56 
57) . ' ' 
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It IS important to point out that the degree of functional 
heterogeneity of type 1 fimbriae described in the present 
Example was not appreciated when any of the studies cited 
above were performed. The results of these studies have made 
xt clear that structural and functional heterogeneity occurs 
withm the class of adhesive organelles commonly referred to 
as MS or type 1 fimbriae and that the adhesive diversity will 
lead to a broader spectrum of receptive surfaces for poten- 
tial colonization. The surprising finding that a FimH family 
Of adhesins exists may prove to be an important step toward 
unravelling the role(s) type i fimbriae may play in the 
ability of enterobacteria to reach their normal habitat or 
gam entry into deeper tissues, where devastating effects can 
occur. 
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EXAMPLE 2 

Expression of ty pe 1 fimbriae in heterologous bacterial 
species 

The fim operon of E. coll comprises a cluster of genes 
5 covering about 8 kb of DNA. This operon has been isolated and 
cloned on plasmids in its entirety. The promoter upstream of 
the fimA gene is located within an invertible DNA sequence, 
which in E. coli leads to a switch on/switch off situation 
for fimbrial synthesis. In one orientation of the invertible 
10 sequence the promoter is directed towards the fim genes, and 
the cell is fimbriated; in the other orientation the promoter 
is directed in the opposite direction, and the cell is non- 
fimbriated. 

Since the regulation of the switch of the invertible promoter 
15 sequence is very complex and involves several genes outside 
the fim operon it is far from certain that the switching 
takes place in other bacteria than the enterics. It was 
therefore considered necessary to insert a replacement promo- 
ter for the expression of the fim genes, and as a model for 
2 0 gene expression in a number of different bacterial species 

the lac promoter was chosen. This promoter has been shown to 
be active and regulatable in many bacterial species. 

Plasmid pPKL83 is a derivative of pPKL4 (ref . 27) carrying 
the entire fim operon in pBR322, in which the promoter has 

25 been destroyed by inserting a Bg-IIl linker in the SnaB site 
located in the promoter sequence. There is a second BglTZ 
site in plasmid pPKL83 upstream of the fim operon (Fig. 6) . 
Plasmid pVLT33 (Fig. 7) is a kanamycin resistant derivative 
of the broad host range plasmid RSFlOlO, carrying the laclf^ 

30 gene and the tac promoter placed upstream of a multiple 

cloning site in which a unique SamHl site is placed. The two 
plasmids were ligated together after digestion of pPKL83 with 
Bglll and pVLT33 with BamHl, In one orientation (pSM1314) , 
this fusion plasmid will express fimbriae in the presence of 
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IPTG due to the fusion between the fimbrial genes and the lac 
promoter . 

The correct orientation of the fusion plasmid pSM1314 was 
verified by transforming it into a strain of E. coll which 
; carries a deletion of the fim operon. Production of fimbriae 
was assayed in two ways: 1) Cell aggregation with fimbrial 
antibodies and 2) ELISA assay of whole cells. The former 
analysis is rather sin^jle: to a small volume (10 ^tl) of an 
outgrown or IPTG- induced culture of the cells to be tested is 
added a small volume (2 ^1) of antibodies raised against 
fimbriae, on a glass slide. After mixing the samples, 
fimbriated cells begin to show cell aggregates which are 
easily observed directly as clumps or under a microscope. No 
clumping was observed with cells of the strain with a fim 
15 deletion, whereas pSM1314 transf ormants of this strain showed 
clearly detectable cell aggregates. The ELISA analysis of 
whole cells confirmed the aggregation assay. In Table 2 below 
the readings from this type of assay are presented, and they 
show quantitatively the occurrence of Fim antigens on the 
cells as a result of IPTG induction of the pSMl314 carrying 
strain. 
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Table 2. Results (dunlicatf»> of eltsa assay fn-r hypo i 
fimbria pxnregged b y pSM1314 in E. roli AAEn.gi (00^ 3-) 

AAEC191(pSM1314) 0.145/0.164 
25 AAECl91(pSM1314)+IPTG 1.026/1.260 

^^^^ 0.113/0.095 

Plasmid PSM1314 also carries a mob site which allows it to be 
transferred to other gram negative bacteria provided a helper 
plasmid is introduced. This type of transfer is most easily 
3 0 performed in triparental " matings in which a donor strain 

(E. coll carrying pSMl3i4) , a helper strain (E. coll carrying 
a plasmid with conjugation genes) and a recipient strain 
carrying a selectable marker not present in any of the two 



10 



15 



20 



25 



WO 95/20657 PCT/DK95/00042 

53 

other strains, are mixed on a plate (directly or on a fil- 
ter) . After some growth (often overnight) this mixture is 
spread on selective plates with antibiotics that only allow 
the recipient carrying the desired plasmid to grow and form 
5 colonies . 

In the present context, the E. cold, strains MClOOO (pSM13l4) 
and MClOOO (pRK2 013) and (as recipient) Enterobacter cloctcae 
strain A50 Nal"^ (ref. 67), were mated. This recipient strain 
is resistant to nalidixic acid. After selection for growth on 
plates with kanamycin plus nalidixic acid the resulting 
clones were grown in liquid medium and assayed for the pres- 
ence of fimbriae in the absence/presence of IPTG. The cell 
aggregation assay was employed. 



This assay showed that fimbriae were produced in the Enterob- 
acter cloacae strain and were present on the cell surface; 
however, full repression of expression from the tac promoter 
was not obtained, most likely due to an increased escape 
synthesis. The results showed that E. coli type 1 fimbriae 
may be synthesized and processed correctly for pili formation 
on the surfaces of heterologous gram-negative bacterial 
species . 

The plasmid pSM1314 in E. coli HBlOl was deposited on 26 
January 1994 with DSM, the Deutsche Sammlxing von Mikroor- 
ganismen und Zellkulturen GmbH, (German Collection of Micro- 
organisms and Cell Cultures) , Mascheroder Weg IB, D-38124 
Braunschweig, Germany, under the accession number DSM 8922. 



EXAMPLE 3 

The copstruction of f iiitff- f u s i on genes and ^he ^^ r-^^c-i^r, 
mannose- sensiti ve FimH fusinn nrotf^ing 

Heterologous sequences mimicing the pre-S2 region of the 
hepatitis B viral surface antigen and a neutralizing epitope 
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Of the cholera toxin B chain were inserted in two different 
positions in the FimH adhesin of type 1 fimbriae. This was 
carried out by introduction of restriction site handles 
(Bgrlll- sites) in the fimH gene, followed by in- frame inser- 
5 tion of heterologous DNA segments encoding the foreign epito- 
pes. In the selected positions such insertions did not sig- 
nificantly alter the adhesive fxinction of the FimH protein, 
since hosts producing hybrid fimbriae that contained the 
chimeric adhesins exhibited adhesion phenotypes and were 

0 normally fimbriated. The heterologous inserts of 52 and 15 
amino acids, respectively, residing in the chimeric FimH 
proteins were recognized by specific sera on the surface of 
the fimbriae on bacterial hosts. The results illustrate the 
possibility of using bacterial adhesins as general presenters 

5 of foreign antigens and epitopes. 

3.1. Materi als and methods 

3.1.1. Bacterial strain an d growth conditinna 

The Escherichia coli kl2 strain HBlOl was used in this study 
as a host for expression of chimeric fimbriae. This strain is 

1 phenotypically Fim" due to a deletion in the fim gene cluster 
(ref . 8) . Cells were grown on solid medium or in liquid broth 
supplemented with appropriate antibiotics. When required, 
gene expression from the lac promoter, residing in fronf of 
the fimH gene in plasmid pLPA22 and its derivatives, was 
induced by the addition of IPTG (isopropyl thiogalacto- 
pyranoside) to the growth medium. 

3.1.2. Pla smidg 

Plasmids PPKL4 (comprising the entire, functional fim gene 
cluster) and pPKL114 (comprising the fimH gene) have been 
described previously. 



PPKLIIS which is a plasmid containing the entire type 1 fim 
gene cluster with a stop linker insertion in the fimH gene 
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(i.e. this plasmid expresses all the proteins necessary for 
the production of fimbriae except the FimH protein) was 
constructed in two steps: 



(i) plasmid pPKL4 (refs. 27, 28) was digested with Kpnl which 
recognizes a unique restriction site in the fimH gene. The 
staggered end of the linearized plasmid was made blunt and 
ligated with the synthetic piece of DNA below (SEQ ID NO: 7) 
containing stop codons in all three reading frames, resulting 
in plasmid pPKL114: 



20 



10 5 ' - GTCGACTTAATTAATTAAGTCGAC - 3 ' 

3 ' - CAGCTGAATTAATTAATTCAGCTG- 5 ' ; 

(ii) the HlndXll-Eagl fragment from pPKLll4, containing the 
entire fim gene cluster with the inactivated fimH gene was 
subsequently inserted into the Hindlll and Ea^I sites of 
15 plasmid pACYC184, resulting in plasmid pPKLlis . 

Plasmid PSM782 (generously provided by S. Molin, Department 
of Microbiology. Technical University of Denmark, DK-2800 
Lyngby) containing the pre-S2 and S encoding regions of the 
hepatitis B viral genome, was made from plasmid X-HBVl (ref . 
72) by subcloning a EcoRI-Dral fragment into pBR322. 

Plasmid PLPA22 was constructed by inserting a 1018 bp PvuII- 
Mlul fragment containing the fimH gene from pPKL4 into 
plasmid PUC18. The insert was positioned downstream and in a 
expression compatible orientation to the lac promoter resid- 
ing on the vector part of the plasmid (Fig. 8) . Expression in 
E, coli HBlOi cells of functional FimH protein was monitored 
by complementing pLPA22 in trans with pPKLllS and testing for 
MS adhesion upon induction with IPTG. 

Plasmids PLPA29 and pLPA30 were made by inserting 9 -mer 
asymmetric Bglll-linkers into the BsaAI and HincII sites, 
respectively, in the fimH gene of plasmid pLPA22 . At six' 
different positions in the pLPA22 fimH gene a Bglii site was 



25 



30 




15 



WO 95/20657 PCT/DK9S/00042 

56 

introduced without changing the reading frame, resulting in 
plasmids PLPA98, PLPA36, pLPA58, pLPA30, pLPA29 and pLPA59 
(Fig. 10) . This was done either by inserting a Bglll linker 
into an appropriately treated restriction enzyme site, or by 
5 changing 1-3 basepairs using PGR and thereby creating a Bgill 
site. 

The plasmid pLPA36 was prepared by opening the pLPA22 fimH 
gene with the restriction enzyme rthllll and making the ends 
blunt using Klenow polymerase and ligating using an 8 mer 
10 BgrJli linker (SEQ id N0:8): 

5' -CAGATCTG-3' 
3' - GTCTAGA C-';' 

Plasmids PLPA58 and pLPA59 were made by Bg-lli site- creating 
site- directed mutagenesis of pLPA22 using standard PGR and 
plasmid PLPA98 was constructed by opening the fimH gene, 
making the ends blunt with T4 DNA polymerase and ligating 
with the below 10 mer B^lll linker (SEQ ID N0:9) : 

5 ' - GAAGATCTTC - 3 ' 
3 ' - C TTCTAGAA G- 5 ' 

Of the six resulting mutated fimH genes, three expressed 
protein that was integrated into type 1 fimbriae, and at the 
same time exhibited mannose- sensitive adhesion. Of these 
three mutated FimH proteins, the two that conferred to E. 
coli cells the strongest mannose- sensitive adhesion were 
expressed from plasmids pLPA29 and pLPA30 (Fig. 9) and these 
two plasmids were investigated further for their ability to 
contain large mutations and still be biological active. 

Plasmid PLPA29 has a 9 bp long symmetrical B^III linker 
inserted into the BsaAI site 66 bp upstream of the stop codon 
for the fimH gene, while plasmid pLPA30 has the same 9 bp 
Bgill linker inserted into the Hindi site 163 bp upstream of 
the stop codon of the fimH gene. 
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The plasmids pLPA37 and pLPA38 (Pig. 8) were constructed by 
inserting a 162 bp DNA fragment encoding the pre-S2 region of 
the Hepatitis B virus surface antigen into the Bglll sites in 
PLPA29 and PLPA30, respectively. This DNA fragment was crea- 
5 ted by a standard polymerase chain reaction (PCR) using the 
synthetic primers: (i) 5 ' -GGAGATCTAATTCCACAACCTT- 3 ' (SEQ ID 
NO:ll) and (ii) 5'-GGAGATCTGTTCAGCGCAGGGT-3' (SEQ IDN0:12), 
and plasmid pSM782 as a ten^late. 

A fragment of plasmid pLPA38 comprising the inserted 
10 heterologous sequence encoding the pre-S2 region of hepatitis 
B surface antigen is shown in the below table wherein the 
heterologous sequence is underlined and the numbers indicated 
correspond to the positions of the amino acid residues in the 
mature PimH protein. 

Bgn\ 

CAG TTC AGA TCT AAT TCC ACA ACC TTC CAC CAA ACT CTG CAA GAT 
Gln^ Phe Arg Ser Asn Ser Thr Thr Phe His Gin Thr Leu Gin Asp 

CCC AGA GTG AGA GGC CTG TAT TTC CCT GCT GGT GGC TCC AGT TCA 
Pro Arg Val Arg Gly Leu Tyr Phe Pro Ala Gly Gly Ser Ser SeV 

GGA ACA GTA AAC CCT GTT CTG ACT ACT GCC TCT CCC TTA TCG TCA 
Gly Thr Val Asn Pro Va! Leu Thr Thr Ala Ser Pro Leu Ser Ser 



ATC TTC TCG AGG ATT GGG GAC CCT GCG CTG AAC AGA TCT TCG ACG 
lie Phe Ser Arg lie Gly Asp Pro Ala Leu Asn Arg Ser Ser Thr 



226 



15 The plasmids pLPASS and pLPA93 (Pig. 8) were then made by 
inserting the below 51 bp synthetic double stranded DNA 
segment encoding amino acids 50-64 (comprising an epitope) of 
the B subunit of the cholera toxin into the Bgli sites on 
PLPA30 and PLPA29, respectively (SEQ ID NO: 10): 
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5 ' - GATCTGTTGAAGTTCCGGGTAGTaVGCATATCGATAGTCAGAAAAAAGCTG - 3 ' 
3 ' - ACAACTTCAAGGCCCATCAGTCGTATAGCTATCAGT Cri ' 1 ' rri 'CGACCTAG - 5 ' 

A fragment of plasmid pLPA93 coit?)rising the heterologous 
sequence encoding the above DNA segment of the B subxinit of 
the cholera toxin is shovm in the below table wherein the 
heterologous sequence is underlined and the numbers indicated 
correspond to the positions of the amino acid residues in the 
mature FimH protein. 



Bg/ It 

ct^ 1°'^ l^^ ^"^ AGT CAG CAT ATC GAT ACT 

G^^ Phe Arg Ser Val Glu Val Pro Gly Ser Gin His He Asp SeT 



BamH MBgl II 
CAG AAA AAA GCT GGA TCT TCG ACG 
Gin Lys Lys Ala Gly Ser Ser Thr 
226 



3.1.3. DNA teghnioueg 

isolation of plasmid DNA was carried out according to the 
method of Bimboim and Doly (ref. 73). Restriction endonu- 
cleases were used according to the manufacturer's specifica- 
tions (Biolabs) . DNA sequencing was carried out by the di- 
deoxy chain termination technique (ref. 49) using a sequenase 
version 2.0 kit from USB. Oligonucleotides were made at the 
core facilities of the Department of Microbiology. Technical 
University of Denmark. 

3.1.4. PCR mprhodolooy 

Polymerase chain reactions (PCR) were performed on a Perkin 
Elmer Cetua DNA Thermal Cycler 480. Reactions were set up as 
100 Ml volumes containing 200 fOA each of dATP, dCTP, dGTP and 
dTTP, 0.2-1.0 MM of each Of the two primers, 2 mM MgCl^, 10 
mM Tris-HCl (pH 8.3), 50 mM KCl, 2.5 units of AmpliTaq DNA 
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polymerase and 0.1-0.2 /xg of plasmid template. The reactions 
were run for 25-30 cycles each consisting of i min. at 94<»C, 
1 min. at 40»C, and i min. at 72«C. For amplification of the 
pre-S2 fragment the above primers 5 ' GGAGATCTAATTCCACAACCTT 3' 
(SEQ ID NO: 11) and 5 ' GGAGATCTGTTCAGCGCAGGGT 3'(SEQ ID NO: 12) 
were used. 

3.1.5. Hemaaal utinatiQi; ! 

The capacity of bacteria to express a D-mannose binding 
phenotype was assayed by their ability to agglutinate guinea 
pig erythrocytes on glass slides. Aliguots of liquid cultures 
grown to an optical density of 3.0 and 5% erythrocytes were 
mixed, and the time until agglutination occurred was 
measured . 

3.1.6. Antiaera 

Rabbit anti-type 1 fimbria serum raised against purified type 
1 fimbriae has previously been described (ref. 74) . a 
monoclonal antibody directed against FimH (ref. 75) was 
kindly provided from Dr. Maryvonne Dho-Moulin, Institut 
National de la Recherche Agronomique, France. Goat serum 
raised against cholera toxin B subunit {international stan- 
dard for WHO No. 12-246) produced at the State Serum Insti- 
tute, Copenhagen, Denmark was kindly provided by same insti- 
tute. A monoclonal antibody directed against the pre-S2 
domain of Hepatitis B surface antigen (ref. 76) was kindly 
provided by Dr. Makoto Mayumi, Jichi Medical School, Japan. 
Fluorescein (PITC) conjugated anti rabbit, anti mouse, or 
anti goat sera were provided from Dakopats, Denmark. 

3.1.7. Fluorescpnce labe>1 1ina and CCD m-i ryoanopy 

Cells from overnight cultures (IPTG- induced, if required) 
were harvested, washed in PBS and fixed for lo minutes at 
room temperature in a 3.5% (w/v) solution of paraformaldehyde 
in PBS. Samples of 20 ^1 were placed on a poly-L-lysine 
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coated slide and air dried. After washing in PBS, 16 ^1 of a 
1:5 (monoclonal) or 1:25 (polyclonal) dilution of the primary 
antiserum was placed on top of each sample and left in a 
moist incubation chamber for l hour. The slides were washed 
three times in PBS and 16 /il of FITC conjugated antiserum 
were added. After two hours in the dark, the slides were 
washed three times in PBS and a drop of Citiflour (Citiflour 
Ltd., London, U.K.) was placed on top of each sample. For 
visualization, a Carl Zeiss Axioplan microscope equipped for 
epifluorescence and phase- contrast was employed. Using a 
charge- coupled device (CCD) camera, pictures were captured as 
12 -bit files with PMIS software (Photometries) and 
subsequently transferred to a Macintosh Quadra 950 computer 
for image analysis. 



15 3.1.8 



ElectT-nn microgrrjpy 



Electron microscopy and immuno- electron microscopy was 
carried out essentially as described previously (ref . 61) . In 
brief, a 25 /xl aliquot of bacterial suspension was placed on 
a carbon -coated, glow discharged grid for 30 seconds. Grids 
were washed in 2 drops of PBS, dehydrated for 5 min in each 
of the following concentrations of ethanol: 25%, 50%, 75% and 
96%, blotted dry and shadowed with tungsten wire at an angle 
of 30°. For immuno- electron microscopy a monoclonal antibody 
directed against the pre-S2 region was used diluted 1:5 as 
the primary antibody and rabbit anti -mouse serum conjugated 
with 10 nm gold particles (Dako) was used in dilution 1:20 as 
the secondary antibody. 



3.2. R<=»H^1^^q 

As described above, two positions in the C- terminal part of 
the FimH protein were engineered to contain heterologous 
sequences mimicing foreign antigenic determinants. In the 
present study, double plasmid systems were used. In each 
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plasmid pair one encoded either a wild- type or an engineered 
version of the fimH gene, whereas the second plasmid encoded 
auxiliary functions such as the two- component Fim- specific 
transport system, regulatory genes and other structural 
components of the fimbrial organelle except FimH (Table 3) . 

3,2.1. Engineering new res trict inn siteg into fimW. 

Based on algorithms for prediction of such parameters as 
hydrophilicity and secondary structure, two potentially 
optimal positions for insertions of heterologous sequences in 
the C- terminal domain of the FimH protein were selected. 
These correspond to positions 225 and 258 in the mature 
protein predicted to be situated in a surf ace -exposed part of 
the FimH protein. In order to facilitate later manipulations, 
15 the fimH gene was subcloned into the pUC18 vector resulting 
in plasmid pLPA22. Subsequently a S^III site was introduced 
in- frame into positions 225 and 258, respectively. This was 
carried out by site -directed mutagenesis employing synthetic 
oligomers resulting in plasmids pLPA30 and pLPA29, respect - 
20 ively (Fig. 9) . 



The introduced B^JII sites resulted in a codon change from a 
Leu to a Phe codon in position 225 and addition of codons for 
the sequence Arg-Ser-Ser, in the case of plasmid pLPA29, and 
addition of codons for the sequence Arg-Ser-Gly after posi- 
25 tion 258 in the case of plasmid pLPA30. Sequence analysis of 
the entire modified fimH genes in plasmids pLPA29 and pLPA30 
confirmed that no other changes had occurred. Host cells 
which in addition to plasmid pLPA29 or pLPABO also contained 
plasmid PPKL115 {fimH). showed wild-type phenotypic charac- 
teristics with regard to adhesion and fimbriation as judged 
by such criteria as hemagglutination (Table 3) and immuno- 
fluorescence microscopy. 
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3.2.3. Engineering h e terologous DNA-secruenceg encoding the 
pre-S2 domain of hepatitis B aiir-f ace antigen and fj. cholera 
toxin end tone into fYmff, 



As heterologous reporter epitopes the pre-S2 region of the 
5 hepatitis B surface antigen and a well characterized region 
of the B siibunit of cholera toxin were selected. The pre-S2 
region is known to contain immunologically important (and 
protective) antigenic determinants (ref. 76). in addition, 
this region is disulphide bond- independent and apparently 
0 more immunogenic than the major S protein. The cholera toxin 
segment consists of residues 50-64 of the B subunit and has 
previously been shown to elicit antibodies that bind to and 
neutralize cholera toxin (ref. 77) . 



15 A DNA segment of 162 nucleotides encoding 52 of the 55 amino 
acids of the pre-S2 region was amplified by PGR technology 
using plasmid pSM782 as ten5)late and primers that provided 
the amplified sequence with flanking BglZZ sites. Following 
restriction with Bgizx and purification the amplified frag- 
ment was inserted into the Bglll sites of plasmids pLPA29 and 
PLPA30 resulting in plasmids pLPA37 and pLPA38, respectively 
(Fig 9). Subsequent sequence analysis confirmed that the 
inserts were correctly oriented and that the reading frame of 
the chimeric fiinH-pre-S2 genes was correct. 
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A synthetic DNA segment encoding the cholera epitope was made 
by annealing two complementary 51 bp oligonucleotides which 
were designed to result in a double stranded DNA fragment 
with a Bgliz overhang in one end, a BairiHI overhang in the 
other and an internal Clal site. The epitope- encoding segment 
was inserted into the B^III site in the flmH gene in plasmids 
PLPA29 and pLPA30, resulting in regeneration of a BglZT site 
at only one end of the insert. This feature was used to 
identify plasmids with correct orientation of the insert. The 
presence of the Clal site was used for initial screening for 
clones containing the insert. Sequence analysis of plasmid 
PLPA93 and PLPA95, both harbouring the epitope -encoding 
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segment confirmed the orientation and conservation of the 
reading frame in the chimeric fimH- cholera genes (Fig 8) . 

3.2.4. Expreggion of chimeric FimH adhesin comprising as 
heterologous seouenceg the pre-S2 domain of hepatitis B 
surface antig en and a cholera toxin epitope. 

To evaluate whether the heterologous inserts in fimH were 
compatible with protein expression the T7 polymerase/promoter 
system of Tabor and Richardson (ref. 78) was used. Subcloning 
into the pGEMB vector system and subsequent assaying revealed 
that proteins with the expected sizeis were produced in all 
cases from the chimeric fimH genes. More inportantly, to 
assess whether the FimH proteins harbouring foreign inserts 
were accepted by the type l fimbrial transport system and 
additionally, whether they were present on the bacterial 
surface in a biologically functional form, the adhesion ' 
phenotype of recombinant strains expressing the chimeric FimH 
proteins was studied. 



Bacterial hosts which in addition to plasmid pLPA38 (pre-S2 
insert in position 225 in FimH) also contained plasmid 
pPKLliS {fimH) gave, when induced by IPTG, good agglutinati 
of guinea-pig erythrocytes indicating the presence of a 
biologically active form of the FimH adhesin on the cells 
(Table 3) . The combination of plasmids pLPA37 (pre-S2 in 
position 258 in FimH) and pPKLllS resulted in weaker, but 
detectable, hemagglutination (Table 3). Furthermore, such 
cells were also shown by electron microscopy to have essen- 
tially normal fimbriation (Fig. lO) . 
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Table 3. Genoti vpe and phenotype of plasmids (A. B or U, 
respectively indicate PA CYC184. nBR322 or pUCIS based vector) ' 
used in this study, position of inserts and hemagglutination 
titer. Host cell: g, coli HBIQI 

Plasmid relevant genotype insert hemagglu- 

position tination^ 



pPKIj4 (B) 


all fxm genes 




15 


pPKLllS (A) 


fxTva 




>600 


pLPA22 (U) 






>600 


pLPA29 (U) 


fimff-Bglll 


258 


>600 


pLPA30 (U) 


fijnff-Bglll 


225 


>600 


pLPA37 (U) 


fijnfr-pre-S2 


258 


>600 


pLPA38 (U) 


fiinff-pre-S2 


225 


>600 


PLPA93 (U) 


fimH- cholera 


225 


>600 


pLPA95 (U) 


fimH- cholera 


258 


>600 


PLPA22 (U) 
+PPKL115 (A) 


fimH*' 
fimH 




10 


pLPA29 
+PPKL115 


fimH-Bglll 
fimH 




7 


pLPA3 0 
+PPKL115 


fimH-Bglll 
fimH 




8 


PLPA3 7 
+PPKL115 


fimH- pre -S2 
fimH 




210 


pLPA38 
+PPKL115 


f imH- pre- S2 
fimH 




100 


PLPA93 
+PPKL115 


fimH- cholera 
fimH 




11 


PLPA95 
+PPKL115 


fimH- cholera 
fimH 




1 a 



a) Hemagglutination of guinea-pig erythrocytes indicated in 
seconds before reaction occurred. The average values of 4 
measurements are given . 

In the cases where a sequence mimicing a cholera epitope had 
been inserted into FimH, viz. pLPA93 (insert in position 225) 
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and PLPA95 (insert in position 258) , respectively, an agglu- 
tination phenotype also resulted when either of these plas- 
mids were complemented by plasmid pPKL115 (flmH) (Table.3). 
Again, this suggested that in spite of the presence of 
foreign peptide segments the chimeric FimH proteins were 
still able to reach the bacterial surface and maintain its 
adhesive function. In addition to the adherence phenotypes of 
the various clones the presence of engineered FimH adhesins 
on the surface of the cells were monitored by CCD microscopy 
in connection with fluorescent antibody methodology employing 
a FimH- specific monoclonal serum. In all cases, significant 
signals, albeit of varying intensity, were detected when 
compared to a negative control strain that harboured the 
auxiliary plasmid, pPKL115, alone. 

3-2.5. Immunological det^^rflon of ^h e pre -52 geoment nf , 
hepatitis B mirface anrioen anrt the chQl«:>ra tny-in epitope, in 
chimeric F imH adhesins. 

Since there was good evidence that the chimeric FimH proteins 
were present on the surface of the E. coll hosts the ability 
of specific antisera, raised against the pre-S2 part of - 
hepatitis B surface antigen or the cholera toxin B chain, 
respectively to recognize the chimeric FimH-pre-S2 and FimH- 
cholera proteins directly on the surface of the recombinant 
bacteria were tested. By immunofluorescence microscopy E. 
coll hosts harbouring either of plasmids pLPA37 or pLPA38 in 
addition to plasmid pPKLii5 were shown to react specifically 
with antisera directed against the inserted heterologous 
sequence, whereas hosts expressing wild- type FimH did not. 
Similar results were obtained with the cholera toxin insert 
in the same positions (plasmids pLPA93/pPKLll5 and 
pLPA95/pPKLll5) . Again, the heterologous inserts in the 
chimeric FimH proteins were recognized by insert -specific 
serum on the bacterial surface, whereas the relevant control 
did not react. 
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These findings demonstrate that the foreign epitopes are 
exposed on the surface of extracellularily located chimeric 
PimH proteins and, significantly, in a conformation which 
mimics the natural conformation of the epitope (s) as it 
5 appears in the native hepatitis B surface antigen or the 
native cholera toxin. 

The results obtained by immunofluorescence microscopy were 
corroborated by immune -electron microscopy, employing the 
pre-S2 specific monoclonal antibody as primary serum and a 

0 colloid gold- labelled secondary antiserum. A significant 

amount Of gold particles were seen, mostly in connection with 
the fxmbrial organelles, on bacterial hosts harbouring chim- 
eric fiinH-pre-S2 genes (Fig. lOb and 10c), whereas only few 
goldparticles were present on the control strain expressing 

5 wxld-type fimbriae (Fig. lOa) . Furthermore, in the latter 
case the gold-particles were not seen to be associated with 
the fimbriae. 

The plasmids pLPA22, pLPA29, pLPA30, pLPA37, pLPA38, pLPA93 
PLPA95 and PPKL115 in E. call HBlOl were deposited on 26 
> January 1994 with DSM, the Deutsche Sammlung von Mikro- 
organismen und Zellkulturen GmbH, (German Collection of 
Microorganisms and Cell Cultures), Mascheroder Weg IB, D- 
38124 Braunschweig, Germany, under the accession numbers DSM 
8915, DSM 8916, DSM 8917, DSM 8918, DSM 8919. DSM 8920, DSM 
8921 and DSM 8923, respectively. 



EXAMPLE 4 



Binding of fhe MFP riass adhe.in rr ^o^T 5 0 t-,n 

thetic pept:id<ag 

14 synthetic peptides were synthesized on an ABI automated 
peptide synthesizer according to the method of Merrifield 
(Merrifield, R.b. 1963. Solid phase peptide synthesis. I. The 
synthesis of tetrapeptide. J. Am. Chem. Soc. 85:2149) The 
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binding of the E. coli strain CSH-50 to these peptides were 
tested essentially as described in Exan^jle 1, The results of 
these binding assays indicated that this MFP class straiii 
adhered strongly to one group of peptides whereas the binding 
the an other group of peptides was absent or weak. In the 
below listing the one- letter code sequences of the synthetic 
peptides are shown in a + group, i.e. the group of peptides 
to which the tested strain adhered strongly, and a - group of 
peptides to which the binding was weak or absent: 



+ group of pen^irif^g 



FnSPl: EAQQMVQPQSPVAVSQSKPGCYDNGKHYQI (SEQ ID NO: 13) 

CB-II-G: EEGKRGARGEBGAAGPVGPBGERGARGNR (SEQ ID NO: 14) 

sMl(19-32): AIQNIRLRHENKDL (SEQ ID NO: 15) 

SM6 (1 - 11 ) : RVFPRGTVENPC ( SEQ ID NO : 16 ) 

sM12(l-12): DHSDLVAEKQRLC (SEQ ID NO: 17) 

sM12 (7-18): AEKQRLEDLGQKC ( SEQ ID NO : 1 8 ) 

SM5 (175-184) : TVKDKLAKEQC (SEQ ID NO:19) 

sMS ( 2 8 - 5 4 ) : KTKNEGLKTENEGLKTENEGLKTENEGC ( S EQ ID NO : 2 0 ) 
- grou p of peptides 

SM5 (134-163): QESKENEKALNELLEKTVKDKIAKEQENKE (SEQ ID NO:21) 
SM5 (117-146) : DLTKELNKTRQELANKQQESKENEKALNEL (SEQ ID N0:22) 
KEAIiDKYELENHD (SEQ ID NO: 23) 
DVENSMLQAN (SEQ ID NO: 24) 

LKTEKSNLERKTAELTSEKKEHEAENDKLKC (SEQ ID NO: 25) 
sM24 (289-303) :HQKLEEQNKTSEASRC (SEQ ID NO:26) 



SM5 (14-26) 
SM6 (22-31) 
SM5 (55-84) 



EXAMPLE 5 

FimH adhesin nf furthgr- cliniral isolates 

The following clinical isolates of E. coll were tested for 
adhesion class according to the methods described in Example 
1: KB-23, KS-54, U221-3, MJ#9-3, MJ#31-3, MJ#ll-2, MJ#2-2. 
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The results of these experiments are illustrated in Fig, 5, 
As explained above, the isolate KB -23 showed the type of 
adhesion, and the isolate U221-3 expressed a M class adhesin 
showing a mannose- resistant type of adhesion and accordingly, 
this strain was classified as having a m'* class adhesin. 
The amino acid sequences of these clinical isolates are shown 
in Fig. 5 and their nucleotide sequences in Table 5 below. 

Table 5 shows the nucleotide sequences of the fimH genes of 
selected fimH genes disclosed in Example 1 [CI#3 (SEQ ID 
NO:50), CI#4 (SEQ ID N0:44) , CI#7 (SEQ ID N0:51), CI#10 (SEQ 
ID NO: 48) and CI#12 (SEQ ID NO: 54)] and as the reference that 
of the E. coll K12 strain PC31 as it was originally disclosed 
by Klemm et al. (ref . 27) as the top sequence designated 
PC3la and the sequence as it was determined recently (PC3lb) . 
15 Additionally, the nucleotide sequences of the following 

clinical isolates of E. coli are shown: KS54 (SEQ ID NO:52), 
U221-3 (SEQ ID NO:53), MJ#9-3 (SEQ ID N0:46), MJ#31-3 (SEQ ID 
N0:47), MJ#ll-2 (SEQ ID NO:43), MJ#2-2 (SEQ ID N0:45) and F- 
18 (SEQ ID N0:42) . 
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Table 5. Nucleotide sequences of the above fzmH genes dis- 
closed in Example 1, E. coli K12 strain PC31 (PC31a and 
PC31b) and the nucleotide sequences of KS54, U221-3, MJ#9-3, 
MJtt31-3, MJ#11.2, MJ#2-2 and F-18. 



1094 



™ !!! *!! !f f f If !!! f^* ^gc tgg tog gta aat 



r-18 

MJll-2 
CI 4 
MJ22 
MJ9-3 
KJ31-3 
CI 10 

pcaib 

CI 3 
CI 7 
1CS54 
0221-3 
Cl 12 

1148 

PC31a 
F-18 
HJll-2 
CI i 

MJ22 



1148 

!!! !^ !!! !f f !!! ^ ^ f f f ^ ^ 



1202 

~ „ !!! ™! !!!^ ^" ^cc gtc gtg aat gtg ggg ova aac 



PC31« 

F-18 
MJll-2 
CI 4 



MJ9-3 T — 

MJ31-3 T — 

CI 10 



CI 7 ~ 211 

KS54 X Z III _ Z Z^Z ~ C — 

U221-3 ZZ 

CI 12 



-T -C- 



Jl 
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Table 5, continued 



1256 

P^e * ~ ~! !!a ~! 11 !f ! til ^ "^"^ GAA ACC 



-C 



1312 



CI 4 
MJ22 



CI 7 
KSS4 

U221-3 
CI 12 



1366 



r "J* !S! in Iff f !f *ff f!* ^ AGT AGC TAT CCA TTT CCT ACC ACC 

MJll-2 -G ^ T T 

CI 4 -G A t ~^ T 

MJ22 -G I~ * — <^ T 

MJ9-3 * T — C T 

MJ31-3 I~ ~I Z~ ~2 C — G — T 

U221-3 HI z~ ~i "I i~ z~ z~ 2IZ ® """^ — 
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Table 5, continued 



r^^Ji* ^ !!!^ !f ! !!! !!! f!! 'J!^ ^ !f f ff^ iff tgc ccg gtg 



1420 



MJll-2 

CI 4 

MJ22 

MJ9-3 

MJ31-3 

CI 10 

raib 

CI 3 — T - 

CI 7 

1CSS4 

0221-3 

CI 12 



1474 



PC31a 

F-18 

MJll-2 

CI 4 

HJ22 

MJ9-3 

HJ31-3 

CI 10 

PC31b 

CI 3 

CI 7 

KSS4 

t7221-3 

CI 12 



1526 



!^ !!!^ *!! !ff !!! fZ! *!! Hf f acc ^ ^ ^ oat gat 
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Table 5, continued 



1582 

EL !H ^ ~ :r :~ ~ ~ ~- ~ ™ ^ ?!? .=5 - 



MJ22 
MJ9-3 
MJ31-3 
CI xo 
PC31b 

CI 3 

CI 7 

KS54 

U221-3 

CI 12 



P-18 


1636 
OOC TGC 


AJll-2 




CI 4 




KJ22 




MJ9-3 




HJ31-3 




CI 10 




PC3lb 




CI 3 




CI 7 




ICS54 




U221-3 




CI 12 




PC31« 

F-ie 

MJll-2 


1690 

TCA GTG C 



CI 4 
NJ22 
MJ9-3 
MJ31-3 
CI 10 
PC31b 
CI 3 
CI 7 
K554 
U221-3 
CI 12 
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Table 5, continued 



1744 

iTlT -~ "~ Iff fff ^ f^f ff^^ f^J fCG GGC AAC TCG ATT TTC ACC AAT ACC GCG 



0221-3 
CI 12 

pcaia 



1798 



F-18 " !ff!!!!!!^f"!fJ^!f!!fffff3*f^J3f^CGCAACGGTACOATT 

MJ22 
MJ9-3 
KJ31-3 
CI 10 
K31b 



-C — G 



CI 7 

ICS54 

U221-3 

18S2 

r^ll" ™ !ff ^! ^ f!* fff GCA GTA GGG ACT TCG GCG GTG AGT 
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Table 5, continued 



X906 



MJll-2 

CI 4 

MJ22 

MJ9-3 

MJ31-3 

CI 10 T— 

9C31b 

CI 3 

CI 7 

ICS54 

U221-3 

CI 12 



PC31a 

F-18 
KJll-2 
CI 4 
MJ22 
M79-3 
HJ31-3 
CI 10 
PC3U> 
CI 3 
CI 7 
KS54 
0221-3 
CI 12 



1960 

GTG CAA TCG ATT ATT GGC GTG ACT TTT GTT TAT CAA 



10 



15 



20 



25 



30 



WO 95/20657 PCT/DK95/dd042 

75 

EXAMPLE 6 

Enrichment spI ection of strains having mutated PimH adhf^aing 
conferring ^^1 tared adhesion ability 

One mechanism whereby new binding activities of bacterial 
adhesins may arise is by random, naturally occurring mutage- 
nesis. In nature, a variety of factors would enrich for 
strains that possessed adhesive capacities conferring a 
selective advantage. In the present example an in vitro 
procedure was used to select for potential mutants with 
altered adhesive capacity. As a target substratum bovine k- 
casein was selected. 

casein is the glycosylated isoform of bovine casein con- 
sists of a single polypeptide chain containing 169 amino acid 
residues the sequence of which has been determined (ref, 68). 
Bovine /c- casein does not contain N-glycosidic linkages, but 
up to six 0-linked oligosaccharides are present in the C- 
terminal region of the molecule (refs. 68, 69) . The sacchari- 
de moieties are heterologous and also vary as a function of 
time after parturition. Of significance for the present study 
IS the fact that D-mannose is not present in the bovine k- 
casein. Only di- to hexasaccharides containing galactose, N- 
acetyl-galactosamine, N- acetyl -glucosamine, fucose and sialic 
acid have been described (ref. 68). Glycoproteins having such 
saccharide compositions would not be expected to serve as a 
receptor for the classic type of the FimH adhesin such as is 
found in E. coli strain PC31. 

Adhesion tests were performed to verify the inability of 
recombinant strains carrying the fimH gene from B. coli 
strain PC31 to adhere to immobilized /c-casein. The s. coli 
strain used, KBlOOl is HBlOl containing plasmids pPKLllS and 
PLPA22 (ref. 70). The adhesion assay was performed using 
microtiter plates coated with 30 /xg/ml *-casein in O.l M 
sodium bicarbonate (pH 9.6) for 30 minutes, followed by 
blocking any remaining binding sites with a subsequent in- 
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cubation with 0.1% bovine serum albumin (BSA) in PBS. A 
quantitative adhesin assay was performed as described in more 
detail elsewhere (ref. 71). Briefly, bacteria were diluted to 
equivalent concentrations (5 x lO'^ cells/lOOAil) in PBS 
5 containing 0.1% BSA, added to coated microtiter wells for 30 
minutes at 37oc. After washing the wells thoroughly to remove 
unbound bacteria, BHI broth was added and the bacteria were 
allowed to grow at 37«C on a rotating platform (150 rpm) 
until the optical density could be measured (2-2.5 hours). 
10 Comparisons can be made of optical densities obtained in the 
test wells to those obtained in standard curves developed 
from the plating of known numbers of bacteria under similar 
conditions, allowing extrapolation to absolute numbers of 
bound bacteria (ref. 70) . 



20 



15 The KBiOOl strain comprising the f±mH gene from PC31 bound to 
immobilized mannsm in significant numbers, but there was 
substantially no measurable adhesion to immobilized /t- casein. 
To select for possible mutant cells having acquired the 
ability to bind to k- casein, cells of KBlOOl were allowed to 
interact with «- casein immobilized on microtiter wells. After 
thorough washing to remove non- adhering bacterial cells, 
cells adhering to the wells were collected and grown over- 
night in BHI broth. These "enriched" bacterial cultures were 
again allowed to interact with immobilized casein, the 
plates were washed and adhering cells collected in nutrient 
broth. This enrichment cycle was repeated up to ten times. 
Bacterial cells obtained from the last of these cycles ("en- 
riched" strains) adhered to k- casein in significantly in- 
creased numbers in comparison to the parent ( "non -enriched " ) 
strain (Table 6.1). Individual colonies of "enriched" KBlOOl 
were isolated and four tested for ability to adhere to k- 
casein. Three enriched cultures (clones) bound to k- casein 
significantly better than did the non- enriched parent strain. 



25 



30 



5 
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Table 6,1. Adhesion to caap»i n of non-enriched and enrichgri P; 
coli strain WBi nm 



Strain bacteria binding to 

K- casein*^ 

Non-enriched KBIOOI 0.043 + 0.018 

(pPKLllS + pLPA22) 

Enriched KBlOOl o.249 ± 0.004 

(pPKLllS + PLPA22) 

a) numbers represent optical density of bacterial growth ± 
S.D. with background O.D. substracted. N = 3. 

To determine whether the new adhesive activity was due to 
plasmid -related changes and not siinply to host cell -related 
changes, plasmid preparations of pLPA22 were made from en- 
riched and from non- enriched strains and used to transform E. 
coli HBlOl containing the auxiliary plasmid pPKLllS . Randomly 
selected transf ormants resistant to ampicillin and chloram- 
phenicol were tested for adhesion to casein, and several of 
the transf ormants harbouring plasmids from enriched cultures 
adhered in significantly increased numbers relative to plas- 
mid -containing cells of the non-enriched parent strain (Table 
6.2) . 

Table 6.2. Adhesion to casein of mi n i fnPKLUs) transf nr-moH 
with plasmids from enT-i r^h g d or nnn-enriched strain yRinm 

Plasmid derived from: bacteria binding to 

If -casein^* 

Non-enriched KBlOOl 5 + 0.1 x 10^ 

Enriched KBlOOl 50 + 1.5 x 10^ 



wo 95/20657 



m 



PCT/DK95/00042 



78 



a) numbers represent mean niimber of bacteria per well + S.D. 



The above results demonstrate that random or spontaneous 
mutations in genes coding for a bacterial adhesin that confer 
5 binding to a new substratum (i.e- a receptor moiety to which 
the parent strain does not bind) , can be selected for by 
appropriate in vitro procedures. 



N = 3. 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRulc 136£j) 



A. Ibe indicatkins made below relate to the microorgmism refened to in the description 
ottp»g<^ .line 



B. IDENTIFICATION OF DEPOSIT 



Furtber deposits are identiGed oo an additsona} sbeet pT] 



Name of depositary institution 

DSM-Oeutsche Sammlung von Mikroorganismen und Zellkulturen GmbH 



Address of depositary institution futtMuUng postai code end cctuary) 

Mascheroder Weg IB 
D-381 24 Braunschweig 
Germany 



Date of deposit 




Aeoessioo Number 


26 January 1994 




DSM 8922 


C ADDITIONAL INDICATIONS (iem 


vc bLud: ifiua tppGeabU) This information is continued on an additional sbeet 



^^^^^^f**f ^® respective Patent Offices of the respective desio- 
MiSS''^^' "^""^ applicants request that a sample of thl depSii- 
bv tSi Jo "'^^^ available to an expert nominated 

tL^Ii-f '^^^^^''v,""5'-^ "^^^^ °" paten? is granted or 

S^X^Id^Jo^SrSiSSraSS!'"''^" "^"^^^ - wLhdrawn°L 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE P/.*.i^«,V»» 



E. SEPARATE FURNISHING OF^INDICATIONS (Ica^bia^ifnd.ppllcMc) 



lOttstLg^ 'Accession 



For receiving Oflice use only 



This sbeet was received with the iniemational appl; 



licaiion 



Authorized oQjccr 




For Iniemational Bureau use only 
I I Tbis sheet was received by the international 



Bureau i 



Authorized oflicer 
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INDICATIONS RELATING TO DEPOSITED MICROORGANISMS 

(PCT Rule 12jbis) 

Additional sheet 

In addition to the microorganism indicated on page '53 of the 
description, the following microorganisms have been deposited 
with 

DSM-Deutsche Sammlung von Mikroorganismen und 
Cellkulturen GmbH 

Mascheroder Weg lb, D-38124 Braunschweig, Germany 
on the dates and under the accession numbers as stated below: 

Accession Date of Description Description 

number deposit Page No. Line No. 



10 



15 



20 



25 



30 



DSM 


8915 


26 


January 


1994 


66 


24 


DSM 


8916 


26 


January 


1994 


66 


24 


DSM 


8917 


26 


J anuary 


1994 


66 


24 , 


DSM 


8918 


26 


January 


1994 


66 


24 i 


DSM 


8919 


26 


January 


1994 


66 


24 


DSM 


8920 


26 


January 


1994 


66 


24 


DSM 


8921 


26 


January 


1994 


66 


25 


DSM 


8923 


26 


January 


1994 


66 


25 



For all of the above- identified deposited microorganisms, the 
£ollowing additional indications apply: 

As regards the respective Patent Offices of the respective 
designated states, the applicants request that a sample of 
the deposited microorganisms stated above only be made 
available to an expert nominated by the requester until the 
date on which the patent is granted or the date on which the 
application has been refused or withdrawn or is deemed to be 
withdrawn. 
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SEQDKNCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: GX BioSystems A/S 
<B) STREET: Mothsvej 70 

(C) CITY: Holte 

(D) COUNTRY: Denmark 

(E) POSTAL CODE (ZIP) : 2840 

(ii) TITLE OF INVENTION: Receptor specific bacterial adhesins and 
their use 

(iii) NUMBER OF SEQUENCES: 55 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC con^atible 

(C) OPERATING SYSTEM: PC - DOS /MS - DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Met Lys Arg Val He Thr Leu Phe Ala Val Leu Leu Met Gly Trp Ser 
^5 10 15 

Val Asn Ala Trp Ser Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He 
20 25 30 

Pro He Gly Gly Gly Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Val 
35 40 45 

Val Asn Val Gly Gin Asn Leu Val Val Asp Leu Ser Thr Gin He Phe 
50 55 60 

Cys His Asn Asp Tyr Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin 
" 75 80 

Arg Gly Ser Ala Tyr Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val 
85 90 95 

Lys Tyr Ser Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro 

105 110 
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Arg Val Val Tyx Asn Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu 
115 120 125 

Tjrr Leu Thr Pro Val Ser Ser Ala Gly Gly Val Ala lie Lys Ala Gly 
130 135 140 

Ser Leu lie Ala Val Leu lie Leu ArgfGln Thr Asn Asn Tyr Asn Ser 
145 150 7 155 160 

Asp Asp Phe Gin Phe Val Trp Asn^tie Tyr Ala Asn Asn Asp Val Val 
165 170 175 

Val Pro Thr Gly Gly Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr 
180 185 190 

Leu Pro Asp Tyr Pro Gly Ser Val Pro lie Pro Leu Thr Val Tyr Cys 
195 200 205 

Ala Lys Ser Gin Asn Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp 
210 215 220 

Ala Gly Asn Ser lie Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin 
225 230 235 240 

Gly Val Gly Val Gin Leu Thr Arg Asn Gly Thr lie lie Pro Ala Asn 
245 250 255 

Asn Thr Val Ser Leu Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly 
260 265 270 

Leu Thr Ala Asn Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn 
275 280 285 

Val Gin Ser He He Gly Val Thr Phe Val Tyr Gin 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unJcnown 

(D) TOPOLOGY: linear 

/ 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Glu Ala Gin Gin Met Val Gin Pro Gin Ser Pro Val Ala Val Ser Gin 
1 5 10 15 

Ser Lys Pro Gly Cys Tyr Asp Asn Gly Lys His Tyr Gin lie 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 3: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 
GGGGGGTGCA CACCTACAGC TCAACCCGG 
(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 
GGGGGTGCAC TCAGGGAACC ATTCAGGCA 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GGGTGCGCAT TATTGATAAA CAAAAGTCAC 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GGGGCATGCT TATTGATAAA CAAAAGTCAC 
(2) INFORMATION FOR SEQ ID N0:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOIiOGY : 1 inear 

(ii) MOLECULE TYPE: DMA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GTCGACTTAA TTAATTAAGT CGAC 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO : 8 : 
CAGATCTG 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GAAGATCTTC 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



10 
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(D) TOPOIOGY: linear 
(ii) MOLECDLE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GATCCAGCTT TTTrCTGACT ATCGATATGC TGACTACCCG GAACTTCAAC A 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GGAGATCTAA TTCCACAACC TT 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GGAGATCTGT TCAGCGCAGG GT 

22 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Glu Ala Gin Gin Met Val Gin Pro Gin Ser Pro Val Ala Val Ser Gin 



5 10 X5 
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Ser Lye Pro Gly Cys Tyr Asp Asn Gly Lys His Tyr Gin lie 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: sunino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Glu Glu Gly Lys Arg Gly Ala Arg Gly Glu Asx Gly Ala Ala Gly Pro 
15 10 15 

Val Gly Pro Asx Gly Glu Arg Gly Ala Arg Gly Asn Arg 
20 25 

<2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: aunino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Ala lie Gin Asn lie Arg Leu Arg His Glu Asn Lys Asp Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Arg Val Phe Pro Arg Gly Thr Val Glu Asn Pro Cys 
15 10 

(2) INFORMATION FOR SEQ ID NO: 17: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : unknown 
(D) TOPOIiCX3Y: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Asp His Ser Asp Leu Val Ala Glu Lye Gin Arg Leu Cys 
15 10 



(2) INFORMATION FOR SEQ ID NO: 18: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) I^DLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Ala Glu Lys Gin Arg Leu Glu Asp Leu Gly Gin Lys Cys 
15 10 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 sunino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Thr Val Lys Asp Lys Leu Ala Lys Glu Gin Cys 
15 10 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: cunino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 
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(ii) MOLKCDLE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Lys Thr Lys Asn Glu Gly Leu Lys Thr Glu Asn Glu Gly Leu Lys Thr 

1 5 in -1 



5 10 

Glu Asn Glu Gly Leu Lys Thr Glu Asn Glu Gly Cys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



15 



(xi) SEQUENCE DESCRIPTION: SEQ i: 

Gin Glu Ser Lye Glu Asn Glu Lys 
1 5 

Thr Val Lys Asp Lys lie Ala Lys 
20 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



> NO: 21: 

Ala Leu Asn Glu Leu Leu Glu Lys 
10 15 

Glu Gin Glu Asn Lys Glu 
25 30 



(xi) SEQUENCE DESCRIPTION: SEQ I 

Asp Leu Thr Lys Glu Leu Asn Lys 
1 5 

Gin Gin Glu Ser Lys Glu Asn Glu 
20 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: xinknown 



) NO:22: 

Thr Arg Gin Glu Leu Ala Asn Lys 
10 15 

Lys Ala Leu Asn Glu Leu 
25 30 
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(D) TOPOLOGY: linear 
(ii) MOIiECDLE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
Lys Glu Ala Leu Asp Lys Tyr Glu Leu Glu Asn His Asp 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDBDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
Asp Val Glu Asn Ser Met Leu Gin Ala Asn 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDBDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Leu Lys Thr Glu Lys Ser Asn Leu Glu Arg Lys Thr Ala Glu Leu Thr 
5 10 15 

Ser Glu Lys Lys Glu His Glu Ala Glu Asn Asp Lys Leu Lye Cys 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDBDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

His Gin Lys Leu Glu Glu Gin Asn Lys Thr Ser Glu Ala Ser Arg Cys 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Met Lys Arg Val lie Thr Leu Phe Ala Val Leu Leu Met Gly Trp Ser 
^5 10 15 

Val Asn Ala Trp Ser Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He 
20 25 30 

Pro He Gly Gly Gly Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala 
35 40 45 

Val Asn Val Gly Gin Asn Leu Val Val Asp Leu Ser Thr Gin He Phe 
50 55 60 

Cys His Asn Asp Tyr Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin 
" 70 75 80 

Arg Gly Ser Ala Tyr Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val 
85 90 95 

Lys Tyr Ser Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro 
100 105 110 

Arg Val Val Tyr Asn Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu 
115 120 125 

Tyr Leu Thr Pro Val Ser Ser Ala Gly Gly Val Ala He Lys Ala Gly 
130 135 140 

Ser Leu He Ala Val Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser 

150 155 160 

Asp Asp Phe Gin Phe Val Trp Asn He Tyr Ala Asn Asn Asp Val Val 
165 170 

Val Pro Thr Gly Gly Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr 
180 185 190 

Leu Pro Asp Tyr Pro Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys 



i 
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195 200 205 

Ala Lys Ser Gin Aen Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp 
210 215 220 

Ala Gly Asn Ser He Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin 

230 235 240 

Gly Val Gly Val Gin Leu Thr Arg Asn Gly Thr He He Pro Ala Asn 
245 250 255 

Asn Thr Val Ser Leu Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly 
260 265 270 

Leu Thr Ala Asn Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn 
275 280 285 

Val Gin Ser He He Gly Val Thr Phe Val Tyr Gin 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 aniino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Met Lys Arg Val He Thr Leu Phe Ala Val Leu Leu Met Gly Trp Ser 
^5 10 15 

Val Asn Ala Trp Ser Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He 
20 25 30 

Pro He Gly Gly Gly Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala 
35 40 45 

Val Aen Val Gly Gin Asn Leu Val Val Asp Leu Ser Thr Gin He Phe 

55 60 

Cys His Asn Asp Tyr Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin 
" 75 80 

Arg Gly Ser Ala Tyr Gly Gly Val Leu Ser Ser Phe Ser Gly Thr Val 
Q5 90 95 

Lys Tyr Asn Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro 

105 110 

Arg Val Val Tyr Asn Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu 
115 120 125 
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Tyr Leu Thr Pro Val Ser Ser Ala Gly Gly Val Ala He Lye Ala Gly 
130 135 

Ser Leu He Ala Val Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser 
145 150 155 160 

Asp Asp Phe Gin Phe Val Trp Asn He Tyr Ala Asn Asn Asp Val Val 
165 170 175 

Val Pro Thr Gly Gly Cys Asp Val Ser Ala Cys Asp Val Thr Val Thr 
180 185 190 

Leu Pro Asp Tyr Pro Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys 
195 200 205 

Ala Lys Ser Gin Asn Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp 
210 215 220 

Ala Gly Asn Ser He Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin 
225 230 235 240 

Gly Val Gly Val Gin Leu Thr Arg Asn Gly Thr He He Pro Ala Asn 
245 250 255 

Asn Thr Val Ser Leu Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly 
260 265 270 

Leu Thr Ala Asn Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn 
275 280 285 

Val Gin Ser He He Gly Val Thr Phe Val Tyr Gin 
290 295 300 

(2) INFORMATION FOR SBQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Met Lys Arg Val He Thr Leu Phe Ala Val Leu Leu Met Gly Trp Ser 
15 10 15 

Val Asn Ala Trp Ser Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He 
20 25 30 

Pro He Gly Gly Gly Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala 
35 40 45 

Val Asn Val Gly Gin Asn Leu Val Val Asp Leu Ser Thr Gin He Phe 
50 55 60 
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Cys His Asn Asp Tyr Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin 

70 75 80 

Arg Gly Ser Ala Tyr Gly Gly Val Leu Ser Ser Phe Ser Glu Thr Val 
85 90 95 

Lys Tyr Asn Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro 
100 105 110 

Arg Val Val Tyr Asn Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu 
115 120 125 

Tyr Leu Thr Pro Val Ser Ser Ala Gly Gly Val Ala He Lys Ala Gly 
130 135 3^4Q 

Ser Leu He Ala Val Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser 

150 155 160 

Asp Asp Phe Gin Phe Val Trp Asn He Tyr Ala Asn Asn Asp Val Val 
1€5 170 175 

Val Pro Thr Gly Gly Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr 
180 185 190 

Leu Pro Asp Tyr Pro Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys 

200 205 

Ala Lys Ser Gin Asn Leu Gly Tyr Tyr Leu Ser Gly Thr Asp Ala Asp 
210 215 220 

Ala Gly Asn Ser He Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin 

230 235 240 

Gly Val Gly Val Gin Leu Thr Arg Asn Gly Thr He He Pro Ala Asn 
245 250 255 

Asn Thr Val Ser Leu Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly 
260 265 270 

Leu Thr Ala Asn Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn 
275 280 285 

Val Gin Ser He He Gly Val Thr Phe Val Tyr Gin 
290 295 300 

!) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 

(C> STRANDEDNESS : untoown 
(D) TOPOLOGY: linear 

(ii) IfiDLECDLE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
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Met Lys Arg Val lie Asn Leu Phe Ala Val Leu Leu Met Gly Trp Ser 
15 10 15 

Val Asn Ala Trp Ser Phe Ala Cye Lys Thr Ala Asn Gly Thr Ala lie 
20 25 30 

Pro lie Gly Gly Gly Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala 
35 40 45 

Val Asn Val Gly Gin His Leu Val Val Asp Leu Ser Thr Gin lie Phe 
50 55 60 

Cys His Asn Asp Tyr Pro Glu Thr lie Thr Asp Tyr Val Thr Leu Gin 
^5 70 75 80 

Arg Gly Ser Ala Tyr Gly Gly Val I-eu Ser Asn Phe Ser Gly Thr Val 
85 90 95 

Lys Tyr Ser Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Leu 
100 105 110 

Arg Val Val Tyr Asn Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu 
115 120 125 

Tyr Leu Thr Pro Val Ser Ser Ala Gly Gly Val Ala lie Lys Ala Gly 
130 135 3_4Q 

Ser Leu lie Ala Val Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser 

150 155 160 

Asp Asp Phe Gin Phe Val Trp Asn He Tyr Ala Asn Asn Asp Val Val 
165 170 175 

Val Pro Thr Gly Gly Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr 
IQO 185 190 

Leu Pro Asp Tyr Pro Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys 
195 200 205 

Ala Lys Ser Gin Asn Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp 
210 215 220 

Ala Gly Asn Ser He Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin 
225 230 235 240 

Gly Val Gly Val Gin Leu Thr Arg Asn Gly Thr He He Pro Ala Asn 
245 250 255 

Asn Thr Val Ser Leu Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly 
260 265 270 

Leu Thr Ala Asn Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn 
275 280 285 

Val Gin Ser He He Gly Val Thr Phe Val Tyr Gin 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 31: 
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(i) SEQDBNCB CHARACTBRISTICS : 

(A) LENGTH: 296 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknovm 
<D) TOPOLOGY: linear 

(ii) MOLECDLE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Met Lys Arg Val He Thr Leu Phe Ala Val Leu Leu Met Gly Trp Ser 
^5 10 15 

Val Asn Ala Trp Ser Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He 
20 25 30 

Pro He Gly Gly Gly Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala 
35 40 45 

Val Asn Val Gly Gin Asn Leu Val Val Asp Leu Ser Thr Gin He Phe 
50 55 go 

Cys His Asn Asp Tyr Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin 
^5 70 75 80 

Arg Gly Ser Ala Tyr Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val 
85 90 95 

Lys Tyr Ser Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro 
100 105 110 

Arg Val Val Tyr Asn Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu 
115 120 125 

Tyr Leu Thr Pro Val Ser Ser Ala Gly Lys Ala Gly Ser Leu He Ala 
130 135 



Val Leu He Leu Arg Gin Thr Asn 



Asn Tyr Asn Ser Asp Asp Phe Gin 



150 155 160 

Phe Val Trp Asn He Tyr Ala Asn Asn Asp Val Val Val Pro Thr Gly 
165 170 175 

Gly Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr Leu Pro Asp Tyr 
180 185 190 

Pro Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys Ala Lys Ser Gin 
195 200 205 

Asn Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp Ala Gly Asn Ser 

215 220 

He Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin Gly Val Gly Val 

230 235 240 

Gin Leu Thr Arg Asn Gly Thr He He Pro Ala Asn Asn Thr Val Ser 
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245 250 255 

Leu Gly Ala Val Gly Thr Ser Ala Val Ser lieu Gly Leu Thr Ala Asn 
260 265 270 

Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Aen Val Gin Ser He 
275 280 285 

He Gly Val Thr Phe Val Tyr Gin 
290 295 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Met Lys Arg Val lie Thr Leu Phe Ala Val Leu Leu Met Gly Trp Ser 
15 10 15 

Val Asn Ala Trp Ser Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He 
20 25 30 

Pro He Gly Gly Gly Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Val 
35 40 45 

Val Asn Val Gly Gin Asn Leu Val Val Asp Leu Ser Thr Gin He Phe 
50 55 60 

Cys His Asn Asp Tyr Pro Glu Thr He Thr Asp Tyr Val Thr Arg Gin 
^5 70 75 80 

Arg Gly Ser Ala Tyr Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val 
85 90 95 

Lys Tyr Ser Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro 
2-00 105 110 

Arg Val Val Tyr Asn Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu 
115 120 125 

Tyr Leu Thr Pro Val Ser Ser Ala Gly Gly Val Ala He Lys Ala Glv 
130 135 140 

Ser Leu He Ala Val Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser 
"5 150 155 160 

Asp Asp Phe Gin Phe Val Trp Asn He Tyr Ala Asn Asn Asp Val Val 
165 170 175 
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Val Pro Thr Gly Gly Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr 
180 185 190 

Leu Pro Asp Tyr Pro Gly Ser Val Pro lie Pro Leu Thr Val Tyr Cys 
195 200 205 

Ala Lys Ser Gin Asn Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp 
210 215 220 

Ala Gly Asn Ser lie Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin 
225 230 235 240 

Gly Val Gly Val Gin Leu Thr Arg Asn Gly Thr lie lie Pro Ala Asn 
245 250 255 

Asn Thr Val Ser Leu Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly 
260 265 270 

Leu Thr Ala Asn Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn 
275 280 285 

Val Gin Ser lie lie Gly Val Thr Phe Val Tyr Gin 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : un3cnown 

(D) TOPOLOGY: linear 

(ii) M5LECDLE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Met Lys Arg Val He Asn Leu Phe Ala Val Leu Leu Met Gly Trp Ser 
^5 10 15 

Val Asn Ala Trp Ser Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He 
20 25 30 

Pro He Gly Gly Gly Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala 
35 40 45 

Val Asn Val Gly Gin Asn Leu Val Val Asp Leu Ser Thr Gin He Phe 
50 55 60 

Cys Hie Asn Asp Tyr Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin 

70 75 80 

Arg Gly Ser Ala Tyr Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val 
85 90 95 

Lys. Tyr Ser Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro 
100 105 110 
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Arg Val Val Tyr Asn Ser Arg Ttir Asp Lye Pro Trp Pro Val Ala Leu 
115 120 125 

Tyr Leu Thr Pro Val Ser Ser Ala Gly Gly Val Val He Lys Ala Gly 
130 135 140 

Ser Leu He Ala Val Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser ^ 

150 155 160 

Asp Asp Phe Gin Phe Val Trp Asn He Tyr Ala Asn Asn Asp Val Val 
165 170 175 

Val Pro Thr Gly Gly Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr 
180 185 190 

Leu Pro Asp Tyr Pro Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys 
195 200 205 

Ala Lys Ser Gin Asn Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp 
210 215 220 

Ala Gly Asn Ser He Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin 
225 230 235 240 

Gly Val Gly Val Gin Leu Thr Arg Asn Gly Thr He He Pro Ala Asn 
245 250 255 

Asn Thr Val Ser Leu Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly 
260 265 270 

Leu Thr Ala Asn Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn 
275 280 285 

Val Gin Ser He He Gly Val Thr Phe Val Tyr Gin 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Met Lys Arg Val He Thr Leu Phe Ala Val Leu Leu Met Gly Trp Ser 
^5 10 15 

Val Asn Ala Trp Ser Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He 
20 25 30 

Pro He Gly Gly Gly Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala 
35 40 45 
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Val Asn Val Gly Gin Asn Leu Val Val Asp l.eu Ser Thr Gin He Phe 
50 55 50 

Cys His Asn Asp Tyx Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin 
^5 70 75 80 

Arg Gly Ser Ala Tyr Gly Gly Val Leu Ser Ser Phe Ser Gly Thr Val 
85 90 95 

Lys Tyr Asn Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro 
100 105 110 

Arg Val Val Tyr Asn Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu 

120 125 

Tyr Leu Thr Pro Val Ser Ser Ala Gly Gly Val Ala He Lys Ala Gly 
130 135 140 

Ser Leu He Ala Val Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser 
"5 150 155 

Asp Asp Phe Gin Phe Val Trp Asn He Tyr Ala Asn Asn Asp Val Val 
1^5 170 175 

Val Pro Thr Gly Gly Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr 
180 185 190 

Leu Pro Asp Tyr Pro Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys 
195 200 205 

Ala Lys Ser Gin Asn Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp 
210 215 220 

Ala Gly Asn Ser He Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin 

230 235 240 

Gly Val Gly Val Gin Leu Thr Arg Asn Gly Thr He He Pro Ala Asn 
245 250 255 

Asn Thr Val Ser Leu Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly 
260 265 270 

Leu Thr Ala Asn Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn 
275 280 285 

Val Gin Ser He He Gly Val Thr Phe Val Tyr Gin 
290 295 300 

O INFORMATION FOR SEQ ID NO:35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unJcnown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Met Lys Arg Val lie Thr Leu Phe Ala Val Leu Leu Met Gly Trp Ser 
1 5 10 15 

Val Asn Ala Trp Ser Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He 
20 25 30 

Pro He Gly Gly Gly Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala 
35 40 45 

Val Asn Val Gly Gin Asn Leu Val Val Asp Leu Ser Thr Gin He Phe 
50 55 60 

Cye His Asn Asp Tyr Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin 
^5 70 75 80 

Arg Gly Ser Ala Tyr Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val 
85 90 95 

Lys Tyr Ser Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro 
100 105 110 

Arg Val Val Tyr Asn Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu 
115 120 125 

Tyr Leu Thr Pro Val Ser Ser Ala Gly Gly Val Ala He Lys Ala Gly 
130 135 140 

Ser Leu He Ala Val Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser 
"5 150 155 160 

Asp Asp Phe Gin Phe Val Trp Asn He Tyr Ala Asn Asn Asp Val Val 
165 170 175 

Val Pro Thr Gly Gly Cys Asp Val Ser Ala His Asp Val Thr Val Thr 
180 185 190 

Leu Pro Asp Tyr Pro Gly Ser Val Pro He Pro Leu Thr Val Tyr Cye 
195 200 205 

Ala Lys Ser Gin Asn Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp 
210 215 220 

Ala Gly Asn Ser He Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin 

230 235 240 

Gly Val Gly Val Gin Leu Thr Arg Asn Gly Thr He He Pro Ala Asn 
245 250 255 

Asn Thr Val Ser Leu Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly 
260 265 270 

Leu Thr Ala Asn Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn 
275 280 285 

Val Gin Ser He He Gly Val Thr Phe Val Tyr Gin 
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290 295 300 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TVPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Met Lys Arg Val lie Thr Leu Phe Ala Val Leu Leu Met Gly Trp Ser 
^ 5 10 15 

Val Asn Ala Trp Ser Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He 
20 25 30 

Pro He Gly Gly Gly Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala 
35 40 45 

Val Asn Val Gly Gin Asn Leu Val Val Asp Leu Ser Thr Gin He Phe 
50 55 60 

Cys His Asn Asp Tyr Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin 
" 70 75 80 

Arg Gly Ser Ala Tyr Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val 
85 90 95 

Lys Tyr Ser Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro 
100 105 110 

Arg Val Val Tyr Asn Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Arg 
115 120 125 

Tyr Leu Thr Pro Val Ser Ser Ala Gly Gly Val Ala He Lys Ala Gly 
130 135 140 

Ser Leu He Ala Val Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser 
"5 150 155 160 

Asp Asp Phe Gin Phe Val Trp Asn He Tyr Ala Asn Asn Asp Val Val 
165 170 175 

Val Pro Thr Gly Gly Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr 
180 185 190 

Leu Pro Asp Tyr Pro Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys 
195 200 205 

Ala Lys Ser Gin Asn Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp 
210 215 220 
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Ala Gly Asn Ser lie Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin 
225 230 235 240 

Gly Val Gly Val Gin Leu Thr Arg Asn Gly Thr lie lie Pro Ala Asn 
245 250 255 

Asn Thr Val Ser Leu Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly 
260 265 270 

Leu Thr Ala Asn Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn 
275 280 285 

Val Gin Ser He He Gly Val Thr Phe Val Tyr Gin 
290 295 300 

(2) INFORMATION FOR SBQ ID NO: 37: 

(i) SEQOENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 37: 

Met Lys Arg Val He Thr Leu Phe Ala Val Leu Leu Met Gly Trp Ser 
^5 10 15 

Val Asn Ala Trp Ser Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He 
20 25 30 

Pro He Gly Gly Gly Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala 
35 40 45 

Val Asn Val Gly Gin Asn Leu Val Val Asp Leu Ser Thr Gin He Phe 
S° 55 60 

Cys His Asn Asp Tyr Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin 
" ^0 75 80 

Arg Gly Ser Ala Tyr Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val 
85 90 95 

Lys Tyr Ser Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro 
100 105 110 

Arg val Val Tyr Asn Ser Arg Thr Asp Lys Pro Tip Pro Val Ala Leu 
115 120 125 

Tyr Leu Thr Pro Val Ser Ser Ala Gly Gly Val Ala He Lys Ala Gly 
130 135 140 

Ser Leu He Ala Val Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser 

150 155 160 



wo 95/20657 PCT/DK95/00042 

110 

Asp Asp Phe Gin Phe Val Trp Asn lie Tyx Ala Asn Asn Asp Val Val 

165 170 175 

Val Pro Thr Gly Gly Cys Asp Val Ser Ala His Asp Val Thr Val Thr 
180 185 190 

Leu Pro Asp Tyr Pro Gly Ser Val Pro lie Pro Leu Thr Val Tyr Cys 
195 200 205 

Ala Lys Ser Gin Asn Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp 
210 215 220 

Ala Gly Asn Ser He Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin 
225 230 235 240 

Gly Val Gly Val Gin Leu Thr Arg Asn Gly Thr He He Pro Ala Asn 
245 250 255 

Asn Thr Val Ser Leu Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly 
260 265 270 

Leu Thr Ala Asn Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn 
275 280 285 

Val Gin Ser He He Gly Val Thr Phe Val Tyr Gin 
290 295 300 

(2) INFORMATION FOR SEQ ID NO:38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Met Lys Arg Val He Thr Leu Phe Ala Val Leu Leu Met Gly Trp Ser 
^ 5 10 15 

Val Asn Ala Trp Ser Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He 
20 25 30 

Pro He Gly Gly Gly Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala 
35 40 45 

Val Asn Val Gly Gin Asn Leu Val Val Asp Leu Ser Thr Gin He Phe 
50 55 60 

Cys His Asn Asp Tyr Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin 
" ^0 75 80 

Arg Gly Ser Ala Tyr Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val 
85 90 95 
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Lys Tyr Ser Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro 
100 105 110 

Arg Val Val Tyr Asn Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu 
lis 120 125 

Tyr Leu Thr Pro Val Ser Ser Ala Gly Gly Val Ala lie Lys Ala Gly 
130 135 140 ■ 

Ser Leu He Ala Val Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser 
145 ISO 155 160 

Asp Asp Phe Gin Phe Val Trp Asn He Tyr Ala Asn Asn Asp Val Val 
165 170 175 

Val Pro Thr Gly Gly Cys Asp Val Ser Ala His Asp Val Thr Val Thr 
180 185 190 

Leu Pro Asp Tyr Pro Gly Ser Val Pro lie Pro Leu Thr Val Tyr Cys 
195 200 205 

Ala Lys Ser Gin Asn Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp 
210 215 220 

Ala Gly Asn Ser He Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin 
225 230 235 240 

Gly Val Gly Val Gin Leu Thr Arg Asn Gly Thr He He Pro Ala Asn 
245 250 255 

Asn Thr Val Ser Leu Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly 
260 265 270 

Leu Thr Ala Asn Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn 
275 280 285 

Val Gin Ser He He Gly Val Thr Phe Val Tyr Gin 
290 295 300 

(2) INFORMATION FOR SBQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Met Lys Arg Val He Thr Leu Phe Ala Val Leu Leu Met Gly Trp Ser 
15 10 15 

Val Asn Ala Trp Ser Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He 
20 25 30 
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Pro He Gly Gly Gly Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala 
35 40 45 

Val Asn Val Gly Gin Asn Leu Val Val Asp Leu Ser Thr Gin He Phe 
50 55 60 

Cys His Asn Asp Tyr Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin 
^5 70 75 80 

Arg Gly Ser Ala Tyr Gly Gly Val Leu Ser Ser Phe Ser Gly Thr Val 
85 90 95 

Lys Tyr Asn Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro 
100 105 110 

Arg Val Val Tyr Asn Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu 
115 120 125 

Tyr Leu Thr Pro Val Ser Ser Ala Gly Gly Val Ala He Lys Ala Glv 
130 135 140 

Ser Leu He Ala Val Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser 
^'^5 ISO 155 160 

Asp Asp Phe Gin Phe Val Trp Asn He Tyr Ala Asn Asn Asp Val Val 
165 170 175 

Val Pro Thr Gly Gly Cys Asp Ala Ser Ala Arg Asp Val Thr Val Thr 
180 185 190 

Leu Pro Asp Tyr Pro Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys 
195 200 205 

Ala Lys Ser Gin Asn Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp 
210 215 220 

Ala Gly Asn Ser He Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin 

230 235 240 

Gly Val Gly Val Gin Leu Thr Arg Asn Gly Thr He He Pro Ala Asn 
245 250 255 

Asn Thr Val Ser Leu Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly 
260 265 270 

Leu Thr Ala Asn Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn 
275 280 285 

Val Gin Ser He He Gly Val Thr Phe Val Tyr Gin 
290 295 300 

I) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE: aunino acid 

( C ) STRANDEDNES S : unknown 

(D) TOPOLOGY: linear 
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(ii) MOLECDZiE TVPE : peptide 

(xi) SKQnBNCB DESCRIPTION: SEQ ID NO: 40: 

Met Lys Arg Val lie Thr Leu Phe Ala Val Leu Leu Met Gly Trp Ser 
^5 10 15 

Val Asn Ala Trp Ser Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He 
20 25 30 

Pro He Gly Gly Gly Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala 
35 40 45 

Val Asn Val Gly Gin Asn Leu Val Val Asp Leu Ser Thr Gin He Phe 
50 55 60 

Cys His Asn Asp Tyr Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin 
" 70 75 80 

Arg Gly Ser Ala Tyr Gly Asp Val Leu Ser Ser Phe Ser Gly Thr Val 
85 90 95 

Lys Tyr Asn Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro 
100 105 110 

Arg Val Val Tyr Asn Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu 

120 125 

Tyr Leu Thr Pro Val Ser Ser Ala Gly Gly Val Ala He Lys Ala Gly 
130 135 j^^o 

Ser Leu He Ala Val Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser 

ISO 155 160 

Asp Asp Phe Gin Phe Val Trp Asn He Tyr Ala Asn Asn Asp Val Val 
IfiS 170 175 

Val Pro Thr Gly Gly Cys Asp Ala Ser Ala Arg Asp Val Thr Val Thr 
180 185 190 

Leu Pro Asp Tyr Pro Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys 
19S 200 205 

Ala Lys Ser Gin Asn Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp 
210 215 220 

Ala Gly Asn Ser He Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin 

230 235 240 

Gly Val Gly Val Gin Leu Thr Arg Asn Gly Thr He He Pro Ala Asn 
245 250 255 

Asn Thr Val Ser Leu Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly 
260 265 270 

Leu Thr Ala Asn Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn 



BN800CID: .^¥0_»6aDB87A1J_> 



wo 95/20657 




PCT/DK95/00042 



114 

275 280 285 

Val Gin Ser lie lie Gly Val Thr Phe Val Tyr Gin 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 



ATGAAACGAG 


TTATTACCCT 


GTTTGCTGTA 


CTGCTGATGG 


GCTGGTCGGT 


AAATGCCTGG 


60 


TCATTCGCCT 


GTAAAACCGC 


CAATGGTACC 


GCTATCCCTA 


TTGGCGGTGG 


CAGCGCCAAT 


120 


GTTTATGTAA 


ACCTTGCGCC 


CGTCGTGAAT 


GTGGGGCAAA 


ACCTGGTCGT 


GGATCTTTCG 


180 


ACGCAAATCT 


TTTGCCATAA 


CGATTATCCG 


GAAACCATTA 


CAGACTATGT 


CACACTGCAA 


240 


CGAGGCTCGG 


CTTATGGCGG 


CGTGTTATCT 


AATTTTTCCG 


GGACCGTAAA 


ATATAGTGGC 


300 


AGTAGCTATC 


CATTTCCTAC 


CACCAGCGAA 


ACGCCGCGCG 


TTGTTTATAA 


TTCGAGAACG 


360 


GATAAGCCGT 


GGCCGGTGGC 


GCTTTATTTG 


ACGCCTGTGA 


GCAGTGCGGG 


CGGGGTGGCG 


420 


ATTAAAGCTG 


GCTCATTAAT 


TGCCGTGCTT 


ATTTTGCGAC 


AGACCAACAA 


CTATAACAGC 


480 


GATGATTTCC 


AGTTTGTGTG 


GAATATTTAC 


GCCAATAATG 


ATGTGGTGGT 


GCCTACTGGC 


54 0 


GGCTGCGATG 


TTTCTGCTCG 


TGATGTCACC 


GTTACTCTGC 


CGGACTACCG 


TGGTTCAGTG 


600 


CCAATTCCTC 


TTACCGTTTA 


TTGTGCGAAA AGCCAAAACC 


TGGGGTATTA 


CCTCTCCGGC 


660 


ACACACGCAG 


ATGCGGGCAA 


CTCGATTTTC 


ACCAATACCG 


CGTCGTTTTC 


ACCTGCACAG 


720 


GGCGTCGGCG 


TACAGTTGAC 


GCGCAACGGT 


ACGATTATTC 


CAGCGAATAA 


CACGGTATCG 


780 


TTAGGAGCAG 


TAGGGACTTC 


GGCGGTGAGT 


CTGGGATTAA 


CGGCAAATTA TGCACGTACC 


840 


GGAGGGCAGG 


TGACTGCAGG 


GAATGTGCAA 


TCGATTATTG 


GCGTGACTTT 


TGTTTATCAA 


900 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECUIiE TYPE: DNA (gencamic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 
ATGAAACGAG TTATTACCCT GTTTGCTGTA CTGCTGATGG GCTGGTCGGT AAATGCCTGG 
TCATTCGCCT GTAAAACCGC CAATGGTACC GCAATCCCTA TTGGCGGTGG CAGCGCCAAT 
GTTTATGTAA ACCTTGCGCC TGCCGTGAAT GTGGGGCAAH ACCTGGTCGT AGATCTTTCG 
ACGCAAATCT TTTGCCATAA CGATTACCCA GAAACCATTA CAGACTATCT CACACTGCAA 
CGAGGTTCGG CTTATGGCGG CGTGTTATCT AGlTnTCCG GGACCGTAAA ATATAATGGC 
AGTAGCTATC CTTTCCCTAC TACCAGCGAA ACGCCGCGGG TTGrTTATAA TTCGAGAACG 
GATAAGCCGT GGCCGGTGGC GCTTTATTTG ACGCCGGTGA GCAGTX3CGGG GGGAGTGGCG 
ATTAAAGCTG GCTCATTAAT TGCCGTGCTT ATTITGCGAC AGACCAACAA CTATAACAGC 
GATGATTTCC AGTTrGTGTG GAATATTTAC GCCAATAATG ATCTGGTGGT GCCCACTCGC 
GGCTGCGATG TTTCTGCTCG TGATGTCACC GTTACTCTGC CGGACTACCC TGGTTCAGTG 
CCGATTCCTC TTACCGTTTA TTGTGCGAAA AGCCAAAACC TGGGGTAITA CCTCTCCGGC 
ACACACGCAG ATGCGGGCAA CTCGATTTTC ACCAATACCG CGTCGTTTTC ACCCGCGCAG 
GGCGTCGGCG TACAGTTGAC GCGCAACGGT ACGATTATTC CAGCGAATAA CACGGTATCG 
TTAGGAGCAG TAGGGACTTC GGCGGTAAGT CrGGGATTAA CGGCAAATTA CGCACGTACC 
GGAGGGCAGG TGACTGCAGG GAATGTGCAA TCGATTATTG GCGTGACnT TGTTTATCAA 

(2) INFORMATION FOR SEQ ID N0:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) JK>IiECDLB TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
ATGAAACGAG TTATTACCCT GTTrGCTGTA CTGCTGATGG GCTGGTCGGT AAATGCCTGG 
TCATTCGCCT GTAAAACCGC CAATGGTACC GCAATCCCTA TTCGCGGTGG CAGCGCCAAT 
GTTTATGTAA ACCTTGCGCC TGCCGTGAAT GTGGGGCAAA ACCTGGTCGT AGATCTTTCG 
ACGCAAATCT TTTGCCATAA CGATTACCCA GAAACCATTA CAGACTATGT CACACTGCAA 
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CGAGGTTCGG CTTATGGCGG CGTGTTATCT AGTTTTTCCG GGACCGTAAA ATATAATGGC 
AGTAGCTATC CTTrCCCTAC TACCAGCGAA ACGCCGCGGG TTGTTTATAA TTCGAGAACG 
GATAAGCCGT GGCCGGTGGC GCTTTArTTG ACGCCGGTGA GCAGTGCGGG GGGAGTGGCG 
ATTAAAGCTG GCTCATTAAT TGCCGTGCTT ATTTTGCGAC AGACCAACAA CTATAACAGC 
GATGATTTCC AGTITGTGTG GAATATTTAC GCCAATAATG ATGTGGTGGT GCCCACTGGC 
GGCTGTGATG CTTCTGCTCG TGATGTCACC GTTACTCTGC CGGACTACCC TGGTTCAGTG 
CCGATTCCTC TTACCGTITA TTGTGCGAAA AGCCAAAACC TGGGGTATTA CCTATCCGGC 
ACACATGCAG ATGCGGGCAA CTCGATTTTC ACCAATACCG CGTCGTITTC ACCCGCGCAG 
GGCGTCGGCG TACAGTTGAC GCGCAACGGT ACGATTATTC CAGCGAATAA CACGGTATCG 
TTAGGAGCAG TAGGGACTTC GGCGGTGAGT CTGGGATTAA CGGCAAATTA TGCACGTACC 
GGAGGGCAGG TGACTGCAGG GAATGTGCAA TCGATTATTG GCGTGACTTT TGTTTATCAA 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : s ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 



ATGAAACGAG TTATTACCCT GTTTGCTGTA CTGCTGATGG GCTGGTCGGT AAATGCCTGG 
TCATTCGCCT GTAAAACCGC CAATGGTACC GCTATCCCTA TTGGCGGTGG CAGCGCCAAT 
GTTTATGTAA ACCTTGCGCC TGCCGTGAAT GTGGGGCAAA ACCTGGTCGT GGATCTTrCG 
ACGCAAATCT TTTGCCATAA CGATTACCCG GAAACCATTA CAGACTATGT CACACTGCAA 
CGAGGTTCGG CTTATGGCGG CGTGTTATCT AGTmrCCG AGACCGTAAA ATATAATGGC 
AGTAGCTATC CTTTCCCTAC TACCAGCGAA ACGCCGCGGG TTGTTTATAA TTCGAGAACG 
GATAAGCCGT GGCCGGTGGC GCTTTATrTG ACGCCTGTGA GCAGTGCGGG GGGAGTGGCG 
ATTAAAGCTG GCTCATTAAT TGCCGTGCTT ATTTTGCGAC AGACCAACAA CTATAACAGC 
GATGATTTCC AGTTTGTGTG GAATATTTAC GCCAATAATG ATGTGGTGGT GCCCACTGGC 
GGCTGTGATG TTTCTGCTCG TGATGTCACC GTTACTrTGC CGGACTACCC TGGTTCAGTG 
CCGATTCCTC TTACCGTTTA TTGTGCGAAA AGCCAAAACC TGGGGTATTA CCTCTCCGGC 
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ACAGACGCAG ATGCGGGCAA CTCGATTTTC ACCAATACCG CGTCGnTTC ACCTGCACAG 720 

GGCGTCGGCG TACAGTTGAC GCGCAACGGT ACGATTATTC CAGCGAATAA CACGGTATCG 780 

TTAGGAGCAG TAGGGACTTC GGCGGTAAGT CTGGGATTAA CGGCAAATTA CGCACGTACC 84 0> 

GGAGGGCAGG TGACTGCAGG GAATGTGCAA TCGATTATTG GCGTGACTTr TGTTTATCAA 900/^ 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDSDNBSS : single 

(D) TOPOLOGY: linear 

(ii) MOLECOUS TYPE: DNA (genomic) 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
ATGAAACGAG TTATTACCCT GTTrGCTGTA CTGCTGATGG GCTGGTCGGT AAATGCCTGG 60 
TCATTCGCCT GTAAAACCGC CAATGGTACC GCAATCCCTA TTGGCGGTGG CAGCGCCAAT 12 0 

GTTTATGTAA ACCTTCCGCC TGCCGTGAAT GTGGGGCAAA ACCTGGTCGT AGATCTTTCG 180 
ACGCAAATCT TTTGCCATAA CGATTACCCA GAAACCATTA CAGACTATGT CACACTGCAA 24 0 

CGAGGTTCGG CTTATGGCGA CGTGTTATCT AGTmTCCG GGACCGTAAA ATATAATGGC 300 
AGTAGCTATC CTTTCCCTAC TACCAGCGAA ACGCCGCGGG TTGTTTATAA TTCGAGAACG 360 
GATAAGCCGT GGCCGGTGGC GCTTTATITG ACGCCGGTGA GCAGTGCGGG GGGAGTGGCG 420 
ATTAAAGCTG GCTCATTAAT TGCCGTGCIT ATTTTGCGAC AGACCAACAA CTATAACAGC 
GATGATTTCC AGTTTGTGTG GAATATTTAC GCCAATAATG ATGTGGTGGT GCCCACTGGC 
GGCTGTGATG TCTCTGCTCG TGATGTCACC GTTACTCTGC CGGACTACCC TCGTTCAGTG 600 
CCGATTCCTC TrACCGTITA TTGTGCGAAA AGCCAAAACC TGGGGTATTA CCTATCCGGC 660 
ACACACGCAG ATGCGGGCAA CTCGATTTTC ACCAATACCG CGTCGTTTTC ACCCGCGCAG 
GGCGTCGGCG TACAGTTGAC GCGCAACGGT ACGATTATTC CAGCGAATAA CACGGTATCG 
TTAGGAGCAG TAGGGACTTC GGCGGTAAGT CTGGGATTAA CGGCAAATTA CGCACGTACC 
GGAGGGCAGG TGACCGCAGG GAATGTGCAA TCGATTATTG GCGTGACTTT TGTTTATCAA 



480 
540 



720 
780 
840 
900 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) lOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 46: 

ATC5AAACGAG TTATTACCCT GTITGCTGTA CTGCTGATGG GCTOGTCGGT AAATGCCTGG 60 

TCATTCGCCT GTAAAACCGC CAATGGTACC GCTATTCCTA TTGX3CGGTGG CAGCGCTAAT 120 

GTITATGTAA ACCTTGCGCC TGCCGTGAAT GTGGGGCAAA ACCTGGTCGT AGATCTTTCG 180 

ACGCAAATCT TTTGCCATAA CGATTATCCG GAAACCATTA CAOACTATGT CACACTGCAA 240 

CGAGGCTCGG CTTATGGCGG CGTGTTATCT AATrrTTCCG GGACC6TAAA ATATAGTGGC 300 

AGTAGCTATC CATTCCCGAC TACCAGCGAA ACGCCGCGGG TTGTTTATAA TTCGAGAACG 360 

GATAAGCCGT GGCCGGTGGC GCTTTATTrG ACGCCTGTGA GCAGTGCGGQ TGGGGTGQCG 420 

ATTAAAGCTG GCTCATTAAT TGCCGTGCTT ATTTTGCGAC AGACCAACAA CTATAACAGC 480 

GATGATTTCC AGTrTGTGTG GAATATTTAC GCCAATAATXS ATtSTGGTGGT GCCTACTGGC 540 

GGCTGCGATG TTTCTGCTCA TGATGTCACC GTTACTCTGC CGGACTACCC TGGTTCAGTG 600 

CCAATTCCTC TTACCGTITA TTGTGCGAAA AGCCAAAACC TGGGGTA1TA CCTCTCCGGC 660 

ACACACGCAG ATGCGGGCAA CTCGATTTTC ACCAATACCG CGTCGTITTC ACCAGCGCAG 720 

GGCGTCGGCG TACAGTTGAC GCGCAACGGT ACGATTATTC CAGCGAATAA CACGGTATCG 780 

TTAGGAGCAG TAGGGACTTC GGCGGTAAGT CTGGGATTAA CGGCAAATTA CGCACGTACC 840 

GGAGGGCAGG TGACTGCAGG GAATGTGCAA TCGATTATTG GCGTGACTIT TGTTTATCAA 900 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
ATGAAACGAG TTATTACCCT GriTGCrGTA CTGCTGATGG GCTCGTCGGT AAAT^CCTCG 
TCATTCGCCT GTAAAACCGC CAATGGTACC GCTATTCCTA TTGGCGGTGG CAGCGCTAAT 



60 
120 
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GTTTATGTAA ACCTTGCGCC TGCCGTGAAT GTGGGGCAAA ACCTGGTCGT AGATCTTTCG 180 

ACGCAAATCT TTTGCCATAA CGATTATCCG GAAACCATTA CAGACTATGT CACACTGCAA 24 0 

CGAGGCTCGG CTTATGGCGG CGTGTTATCT AATITrrCCG GGACCGTAAA ATATAGTGGC 300 

A6TAGCTATC CATTTCCGAC TACCAGCGAA ACGCCGCGGG TTGTTTATAA TTCGAGAACG 360 

GATAAGCCGT GGCCGGTGGC GCTTTATTTG ACGCCTGTGA GCAGTGCGGG TGGGGTGGCG 420 

ATTAAAGCTG GCTCATTAAT TGCCGTGCTT ATTTTGCGAC AGACCAACAA CTATAACAGC 480 

GATGATTTCC AGTTTGTGTG GAATATTTAC GCCAATAATG ATGTGGTGGT GCCTACTGGC 540 

GGCTGCGATG TTTCTGCTCA TGATGTCACC GTTACTCTGC CGGACTACCC TGGTTCAGTG 600 

CCAATTCCTC TTACCGTITA TTGTGCGAAA AGCCAAAACC TGGGGTATTA CCTCTCCGGC 660 

ACACACGCAG ATGCGGGCAA CTCGATTTTC ACCAATACCG CGTCGTITrC ACCAGCGCAG 720 

GGCGTCGGCG TACAGTTGAC GCGCAACGGT ACGATTATTC CAGCGAATAA CACGGTATCG 780 

TTAGGAGCAG TAGGGACTTC GGCGGTAAGT CrGGGATTAA CGGCAAATTA CGCACGTACC 840 

GGAGGGCAGG TGACTGCAGG GAATGTGCAA TCGATTATrG GCGTGACTTT TGTTTATCAA 900 

(2) INFORMATION FOR SEQ ID NO:48: * 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 888 base pairs 

(B) TYPE: nucleic acid 

( C) STRANDBDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 



ATGAAACGAG 


TTATTACCCT 


GTTTGCTGTA CTGCTGATGG GCTGGTCGGT AAATGCCTGG 


60 


TCATTCGCCT 


GTAAAACCGC 


CAATGGTACC GCTATCCCTA TTGGCGGTGG CAGCGCCAAT 


120 


GTTTATGTAA ACCTTGCGCC 


CGCCGTGAAT GTGGGGCAAA ACCTGGTCGT GGATCTTTCG 


180 


ACGCAAATCT 


TTTGCCATAA 


CGATTATCCG GAAACCATTA CAGACTATGT CACACTGCAA 


240 


CGAGGCTCGG 


CTTATGGCGG 


CGTGTTATCT AATTTTTCCG GGACCGTAAA ATATAGTGGC 


300 


AGTAGCTATC 


CATTTCCTAC 


CACCAGCGAA ACGCCGCGCG TTGTTTATAA TTCGAGAACG 


360 


GATAAGCCGT 


GGCCGGTGGC 


GCTTTATTTG ACGCCTGTGA GCAGTGCGGG TAAAGCTGGC 


420 


TCATTAATTG 


CCGTGCTTAT 


TTTGCGACAG ACCAACAACT ATAACAGCGA TGATTTCCAG 


480 


TTTGTGTGGA 


ATATTTACGC 


CAATAATGAT GTGGTGGTGC CTACTGGCGG CTGCGATGTT 


540 
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TCTGCTCGGG ATGTCACCGT TACTCTGCCG GACTACCCTG GTTCAGTGCC AATTCCTCTT 600 

ACCGTTTATT GTGCGAAAAG CCAAAACCTG GGGTATTACC TCTCCGGCAC ACACGCAGAT 660 

GCGGGCAACT CGATTITCAC CAATACCGCG TCGTITTCAC CTGCACAGGG CGTCGGCGTA 720 

CAGTTGACGC GCAACGGTAC GATTATTCCA GCGAATAACA CGGTATCGTT AGGAGCAGTA 780 

GGGACTTCGG CGGTGAGTTT GGGATTAACG GCAAATTATG CACGTACCGG AGGGCAGGTG 84 0 

ACTGCAGGGA ATGTGCAATC GATTATrGGC GTGACnTTG TTTATCAA 888 
(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:49: 
ATGAAACGAG TTATTACCCT GTTTGCTGTA CTGCTGATGG GCTCGTCGGT AAATGCCTGG 60 
TCATTCGCCT GTAAAACCGC CAATGGTACC GCTATCCCTA TTGGCGGTGG CAGCGCCAAT 
GTTTATGTAA ACCTTGCGCC CGTCGTGAAT GTGGGGCAAA ACCTGGTCGT GGATCTTTCG 
ACGCAAATCT TTTGCCATAA CGATTATCCG GAAACCATTA CAGACTATGT CACACTGCAA 
CGAGGCTCGG CTTATGGCGG CGTGTTATCT AATTTITCCG GGACCGTAAA ATATAGTGGC 
AGTAGCTATC CATTTCCTAC CACCAGCGAA ACGCCGCGCG TTGriTATAA TTCGAGAACG 
GATAAGCCGT GGCCGGTGGC GCTTTATrrG ACGCCTGTGA GCAGTGCGGG CGGGGTGGCG 
ATTAAAGCTG GCTCATTAAT TGCCGTGCTT ATTTTGCGAC AGACCAACAA CTATAACAGC 
GATGATTTCC AGTTTGTGTG GAATATTTAC GCCAATAATG ATGTGGTGGT GCCTACTGGC 
GGCTGCGATG TTTCTGCTCG TGATGTCACC GTTACTCTGC CGGACTACCC TGGTTCAGTG 
CCAATTCCTC TTACCGTTTA TTCTGCGAAA AGCCAAAACC TGGGGTATTA CCTCTCCGGC 
ACACACGCAG ATGCGGGCAA CTCGATTTTC ACCAATACCG CGTCGTTITC ACCTCCACAG 
GGCGTCGGCG TACAGTTGAC GCGCAACGGT ACGAATATTC CAGCGAATAA CACGGTATCG 
TTAGGAGCAG TAGGGACTTC GGCGGTGAGT CTGGGATTAA CGGCAAATTA TGCACGTACC 
GGAGGGCAGG TGACTGCAGG GAATGTGCAA TCGATTAITC GCGTOACTTT TGTTTATCAA 



120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 



(2) INFORMATION FOR SEQ ID N0:50: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LBKGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRAJNDBDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECDLE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 



ATGAAACGAG 


TTATTAACCT 


GTTTGCTGTA 


CTGCTGATGG 


GCTGGTCGGT 


AAATGCCTGG 


60 


TCATTCGCCT 


GTAAAACCGC 


CAATGGCACC 


GCTATCCCTA 


TTGGCGGTGG 


CAGCGCCAAT 


120 


GTTTATGTAA 


ACCTTGCGCC 


CGCCGTGAAT 


GTGGGGCAAA 


ACCTGGTCGT 


GGATCTTTCG 


180 


ACGCAAATCT 


TTTGCCATAA 


CGATTACCCG 


GAAACCATTA 


CAGATTATGT 


CACACTGCAA 


240 


CGAGGCTCGG 


CTTATGGCGG 


CGTGTTATCT AA'rmTCCG 


GGACCGTAAA ATATAGTGGC 


■J w 


AGTAGCTATC 


CATTTCCGAC 


CACCAGTGAA ACGCCGCGGG TTGTTTATAA TTCGAGAACG 


360 


GATAAGCCGT 


GGCCGGTGGC 


GCTTTATTTG 


ACGCCTGTGA 


GCAGTGCGGG 


CGGGGTGGTG 


420 


ATTAAAGCTG 


GCTCATTAAT 


TGCCGTGCTT ATTTTGCGAC AGACCAACAA 


CTATAACAGC 


480 


GATGATTTCC 


AGTTTGTGTG 


GAATATTTAC 


GCCAATAATG 


ATGTGGTGGT 


GCCCACTGGC 


540 


GGCTGCGATG 


TTTCTGCTCG 


TGATGTCACC 


GTTACTCTGC 


CGGACTACCC 


TGGTTCAGTG 


600 


CCGATTCCTC 


TTACCGTTTA 


TTGTGCGAAA 


AGCCAAAACC 


TGGGGTATTA 


CCTCTCCGGC 


660 


ACACACGCAG 


ATGCGGGCAA 


CTCGATTTTC 


ACCAATACCG 


CGTCGTTTTC 


ACCTGCACAG 


720 


GGCGTCGGCG 


TACAGTTGAC 


GCGCAACGGT 


ACGATTATTC 


CAGCGAATAA 


CACGGTATCG 


780 


TTAGGAGCAG 


TAGGGACTTC 


GGCGGTAAGT 


CTGGGATTAA 


CGGCAAATTA 


CGCACGTACC 


840 


GGAGGGCAGG 


TGACTGCAGG 


GAATGTGCAA 


TCGATTATTG 


CCGTGACTTT 


TGTTTATCAA 


900 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
ATGAAACGAG TTATTAACCT GTTTGCTGTA CTGCTGATGG GCrTGGTCGGT AAATGCCTGG 
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TCATTCGCCT 


GTAAAACCGC 


CAATGrGCACC 








15 0 
u 


GTTTATGTAA ACCTTGCGCC 


CGCCGTGAAT 








X o u 


ACGCAAATCT 


TTTGCCATAA 


CGATTACCCG 


GAUICCATTA 


Xm'^U'^^ X^%X ^7 A, 




5 A n 


CGAGGTTCGG 


CTrATGGCGG 


CGTGTTATCT 


AATTTTTCCG 


GGACCGTAAA 


ATATAGTYSGC 


J w V 


AGTAGCTATC 


CATTTCCTAC 


CACCAGCGAA 


ACGCTGCGGG 


TTGTTTATAA 




-5 o 


GATAAGCCGT 


GGCCGGTGGC 


GCTTTAT^HG 


ACGCCTGTC3A 






A o o 


ATTAAAGCTG 


GCTCATTAAT 


TGCCGTGCTT 


ATTTTGCGAC 








GATGATTTCC 


AGTTTGTGTG 


GAATATTTAC 


GCCAATAATG 




uUCTACTGGC 


540 


GGCTGCGATG 


TTTCTGCTCG 


TGATGTCACC 


GTTACTCTGC 




rUvvTTCAQTG 


600 


CCAATTCCTC 


TTACCGTTTA 


TTGTGCGAAA AGCCAAAACC 




CCTCTC CGGC 


660 


ACACACGCAG 


ATGCGGGCAA 


CTCGATTTTC 


ACCAATACCG 


CCTCGTTTTC 


ACCAGCGCAG 


720 


GGCGTCGGCG 


TACAGTTGAC 


GCGCAACGGT ACX3A1TATTC 


CAGCGAATAA 


CACGGTATCG 


780 


TTAGGAGCAG 


TAGGGACTTC 


GGCGGTAAGT 


CTGGGATTAA 


CGGCAAATTA 


CGCACGTACC 


840 


GGAGGGCAGG 


TGACTGCAGG 


GAATGTGCAA 


TCGATTATTG 


GCGTGACTTT 


TGTTTATCAA 


900 



(2) INFORMATION FOR SBQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOIiOGY: linear 

(ii) MOLECDLB TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

ATGAAACGAG TTATTACCCT GTTTGCTGTA CTGCTGATGG GCTGGTCGGT AAATGCCTGG 60 

TCATTCGCCT GTAAAACCGC CAATGGTACC GCTATCCCTA TTGGCGGTGG CAGCGCTAAT 120 

GTTTATGTAA ACCTTGCGCC TGCCGTGAAT GTGGGGCAAA ACCTGGTCGT AGATCTTTCG 180 

ACGCAAATCT TTTGCCATAA CGATTATCCG GAAACCATTA CAGACTATGT CACACTGCAA 240 

CGAGGCTCGG CTTATGGCGG CGTGTTATCT AATTTTrCCG GGACCGTAAA ATATAGTGGC 300 

AGTAGCTATC CATTTCCGAC TACCAGCGAA ACGCCGCGGG TTGTTTATAA TTCGAGAACG 360 

GATAAGCCGT GGCCGGTGGC GCnTATTTG ACGCCTGTGA GCAGTGCGGG TGGGGTGGCG 420 

ATTAAAGCTG GCTCATTAAT TGCCGTGCTT ATTTTGCGAC AGACCAACAA CTATAACAGC 480 
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GATGATTTCC AGTTTGTGTG GAATATTTAC GCCAATAATG ATGTGGTGGT GCCTACTGGC 540 

GGCTGCGATG nTCTGCTCA TGATGTCACC GTTACTCTGC CGGACTACCC TCGTTCAGTG 600 

CCAATTCCTC TTACCGTTTA TTGTGCGAAA AGCCAAAACC TGGGGTATTA CCTCTCCGGC 660 

ACACACGCAG ATGCGGGCAA CTCGATnTC ACCAATACCG CGTCGTTTTC ACCAGCGCAG 720 

GGCGTCGGCG TACAGTTGAC GCGCAACGGT ACGATTATTC CAGCGAATAA CACGGTATCG 780 

TTAGGAGCAG TAGGGACTTC GGCGGTAAGT CTGGGATTAA CGGCAAATTA CGCACGTACC 840 

GGAGGGCAGG TGACTGCAGG GAATGTGCAA TCGATTATTG GCGTGACITT TGTTTATCAA 900 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 



ATGAAACGAG 


TTATTACCCT 


GTTTGCTGTA 


CTGCTGATGG 


GCTGGTCGGT AAATGCCTGG 


60 


TCATTCGCCT 


GTAAAACCGC 


CAATGGTACC 


GCTATCCCTA 


TTGGCGGTGG CAGCGCCAAT 


120 


GTTTATGTAA ACCTTGCGCC 


CGCCGTGAAT 


GTGGGGCAAA ACCTGGTCGT GGATCTTTCG 


180 


ACGCAAATCT 


TTTGCCATAA 


CGATTATCCG 


GAAACCATTA 


CAGACTATGT CACACTGCAA 


240 


CGAGGCTCGG 


CTTATGGCGG 


CGTGTTATCT 


AAlVi'lTCCG 


GGACCGTAAA ATATAGTGGC 


300 


AGTAGCTATC 


CATTTCCTAC 


CACCAGCGAA ACGCCGCGCG 


TTGTTTATAA TTCGAGAACG 


360 


GATAAGCCGT 


GGCCGGTGGC 


GCTTTATTTG 


ACGCCTGTGA 


GCAGTGCGGG CGGGGTGGCG 


420 


ATTAAAGCTG 


GCTCATTAAT 


TGCCGTGCTT 


ATTTTGCGAC 


AGACCAACAA CTATAACAGC 


480 


GATGATTTCC AGTTTGTGTG 


GAATATTTAC 


GCCAATAATG 


ATGTGGTGGT GCCTACTGGC 


540 


GGCTGCGATG 


TTTCTGCTCG 


TGATGTCACC 


GTTACTCTGC 


CGGACTACCC AGGTTCAGTG 


600 


CCAATTCCTC 


TTACCGTTTA 


TTGTGCGAAA AGCCAAAACC 


TGGGGTATTA CCTCTCCGGC 


660 


ACACACGCAG 


ATGCGGGCAA 


CTCGATTTTC 


ACCAATACCG 


CGTCGTTTTC ACCTGCACAG 


720 


GGCGTCGGCG 


TACAGTTGAC 


GCGCAACGGT 


ACGATTATTC 


CAGCGAATAA CACGGTATCG 


780 


TTAGGAGCAG 


TAGGGACTTC 


GGCGGTGAGT 


CTGGGATTAA 


CGGCAAATTA TGCACGTACC 


84 0 


GGAGGGCAGG 


TGACTGCAGG 


GAATGTGCAA 


TCGATTATTG 


GCGTGACTTT TGTTTATCAA 


900 
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(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOIiECDLE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 



ATGAAACGAG 


TTATTACCCT 


GTTTGCTGTA 


CTGCTGATGG 


GCTGGTCGGT 


AAATGCCTGG 


60 


TCATTCGCCT 


GTAAAACCGC 


CAATGGTACC 


GCTATCCCTA 


TTGGCGGTGG 


CAGCGCCAAT 


120 


GTTTATGTAA ACCTTGCGCC 


TGCCGTGAAT 


GTGGGGCAAA ACCTGGTCGT 


GGATCTTTCG 


180 


ACGCAAATCT 


TTTGCCATAA 


CGATTACCCG 


GAAACCATTA 


CAGACTATGT 


CACACTGCAA 


240 


CGAGGTTCGG 


CTTATGGCGG 


CGTGTTATCT 


AGTTTTTCCG 


GGACCGTAAA ATATAATGGC 


300 


AGTAGCTATC 


CTTTCCCTAC 


TACCAGCGAA ACGCCGCGCG 


TTGriTATAA 


TTCGAGAACG 


360 


GATAAGCCGT 


GGCCGGTGGC 


GCTTTATTTG 


ACGCCTGTGA 


GCAGTGCGGG 


GGGAGTGGCG 


420 


ATTAAAGCTG 


GCTCATTAAT 


TGCCGTGCTT 


ATTTTGCGAC 


AGACCAACAA 


CTATAACAGC 


480 


GATGATTTCC 


AGTTTGTGTG 


GAATATTTAC 


GCCAATAATG 


ATGTGGTGGT 


GCCCACTGGC 


540 


GGCTGTGATG 


TTTCTGCTTG 


TGATGTCACC 


GTTACTTTGC 


CGGACTACCC 


TGGTTCAGTG 


600 


CCGATTCCTC 


TTACCGTTTA 


TTGTGCGAAA AGCCAAAACC 


TGGGGTATTA 


CCTCTCCGGC 


660 


ACACACGCAG 


ATGCGGGCAA 


CTCGATTTTC 


ACCAATACCG 


CGTCGTTTTC 


ACCTGCACAG 


720 


GGCGTCGGCG 


TACAGTTGAC 


GCGCAACGGT 


ACGATTATTC 


CAGCGAATAA 


CACGGTATCG 


780 


TTAGGAGCAG 


TAGGGACTTC 


GGCGGTAAGT 


CTGGGATTAA 


CGGCAAATTA 


CGCACGTACC 


840 


GGAGGGCAGG 


TGACTGCAGG 


GAATGTGCAA 


TCGATTATTG 


GCGTGACTTT 


TGTTTATCAA 


900 


(2) INFORMATION FOR SEQ ID NO: 55: 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SBQUKNCE DESCRIPTION: SEQ ID N0:55: 

GATCTGTTGA AGTTCCGGGT AGTCAGCATA TCGATAGTCA GAAAAAAGCT G 51 
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CLAIMS 



1. A method of targeting a bacterial adhesin to a specific 
location, comprising (i) identifying in said location adhes- 
in- interacting receptor moiety which is recognizable by 
bacterial adhesins, (ii) isolating a bacterial cell that 
grows in said location and expresses an adhesin recognizing 
and interacting with said receptor moiety, and administering 
to the location the bacterial cell or the adhesin under 
conditions where the adhesin and the receptor moiety are 
brought into interacting contact whereby the adhesin is 
associated with the receptor moiety. 

2. A method according to claim 1 wherein the receptor moiety 
is selected from the group consisting of a glycolipid, a 
glycoprotein, a protein, a polypeptide, a saccharide moiety 
and a peptide. 

3. A method according to claim 1 wherein the isolated bacte- 
rial cell expresses an adhesin having modified receptor 
moiety- binding properties relative to an adhesin natively 
expressed by the cell, the isolation of the cell comprising 
identifying in a parent bacterial cell, DNA sequence (s) 
coding for the binding domain (s) of said natively expressed 
adhesin and substituting at least one codon herein, whereby a 
modified adhesin molecule is expressed that is different in 
at least one amino acid from the adhesin expressed natively, 
and selecting a the bacterial cell expressing the modified ' 
adhesin having an altered adhesion phenotype relative to the 
natively expressed bacterial adhesin. 

4 . A method according to claim l wherein the bacterial cell 
expressing an adhesin that recognizes and binds to the 
receptor moiety is a recombinant bacterial cell derived from 
a parent bacterial cell that does not produce an adhesin 
binding to said receptor, by inserting into the parent cell a 
DNA sequence coding for an adhesin binding to the receptor 
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moiety, and selecting a bacterial cell expressing the DNA 
sequence . 



5. A method according to claim 1 wherein a non-adhesin com- 
pound is associated with the adhesin, whereby said compound 

5 is targeted with the adhesin to the location comprising the 
receptor moiety recognizable by the adhesin. 

6. A method according to claim 5 wherein the con^jound is 
covalently bound to the adhesin. 



10 



7. A method according to claim 6 wherein the compound is part 
of a fusion protein con^rising the adhesin, the compound 
being selected from the group consisting of an enzyme, an 
antibody, an epitope and a toxin. 

8. A method according to claim 5 wherein the cortpound is one 
associated with the adhesin by a non-covalent binding. 

15 9. A method according to claim 8 wherein the conpound is 
selected from the group consisting of a pharmacologically 
active, a diagnostically active and an imaging compound. 

10. A method according to claim 1 wherein the specific loca- 
tion is a human or animal surface. 

20 11. A method according to claim 1 wherein the specific loca- 
tion is a plant surface. 

12. A method according to claim 1 wherein the bacterial cell 
expresses a recombinant bacterial adhesin variant derived 
from a naturally occurring parent adhesin, said recombinant 
25 bacterial adhesin variant having altered binding properties 
relative to the naturally occurring adhesin from which it is 
derived, the altered binding properties including binding to 
at least one receptor moiety to which the parent adhesin does 
not bind. 
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13. A method according to claim 12 wherein the adhesin vari- 
ant is derived from a naturally occurring adhesin isolated 
from a cell structure selected from the group consisting of a 
capsule, a lipopolysaccharide layer, an outer membrane pro- 

5 tein, a flagellum, a pilus, a fimbria, a non-fimbrial adhesin 
(NFA) and an af imbrial adhesin (AFA) . 

14. A method according to claim 12 or 13 wherein the adhesin 
variant is a protein having an amino acid sequence differing 
in at least one amino acid residue from its parent protein 

10 adhesin. 

15. A method according to claim 14 wherein the adhesin vari- 
ant is a FimH adhesin having an amino acid sequence which 
differs from the E. coll PC31 FimH adhesin as defined in 
Table 1 herein in at least one amino acid. 

15 16. A method according to claim 15 wherein the FimH adhesin 
is one binding to a receptor selected from the group consist- 
ing of a domain where mannosyl residues are not terminal and 
a domain devoid of saccharide. 



20 



17. A method according to claim 15 wherein the adhesin vari- 
ant is a chimeric adhesin comprising amino acid sequences 
from different FimH adhesins. 



18. A method according to claim 15 wherein the FimH adhesin 
has an amino acid sequence which is selected from the group 
consisting of sequences appearing in Fig. 5 herein with 

25 designations CI#12, CI#4, CI#7 or CSH-50. 

19. A method according to claim 15 wherein the adhesin is one 
which, when tested for binding to yeast mannan (Mn) , human 
plasma f ibronectin (Fn) , periodate treated Fn and the syn- 
thetic peptide FnSpl comprising the first 30 amino acids of 

30 Fn, only binds to Mn (M class) . 
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20. A method according to claim 15 wherein the adhesin is one 
which, when tested for binding to yeast mannan (Mn) , human 
plasma fibronectin (Fn) , periodate treated Pn and the syn- 
thetic peptide FnSpl comprising the first 30 amino acids of 
Pn, binds to Mn and Fn (MP class) • 



21. A method according to claim 15 wherein the adhesin is one 
which, when tested for binding to yeast mannan (Mn) , human 
plasma fibronectin (Fn) , periodate treated Pn and the syn- 
thetic peptide FnSpl comprising the first 30 amino acid 

10 residues of Fn, binds to all of these (MFP class). 

22. A method according to claim 15 wherein the adhesin is one 
which, when tested for binding to five Fn- fragments obtained 
by thermolysin treatment, only binds to the 40-JcDa gelatin- 
binding fragment. 

15 23. A method according to claim 22 wherein the adhesin is/ one 
which, when tested for binding to five Fn- fragments obtained 
by thermolysin treatment, binds to all five fragments. 

24. A method according to claim 15 wherein the adhesin is at 
least 90% homologous to the PC31 PimH adhesin. 

20 25. A method according to claim 15 wherein the adhesin is a 
chimeric adhesin comprising amino acid secjuences from differ- 
ent PimH adhesins. 



26. A method according to claim 15 comprising an amino acid 
sequence which differs from the E. coll PC31 FimH adhesin by 
25 at least one amino acid occurring between residues 27 and 119 
of the mature PimH sequence. 



27, A method according to claim 15 wherein the adhesin binds 
to a receptor moiety selected from the group consisting of a 
human receptor moiety, an animal receptor moiety, a plant 
30 receptor moiety and an inanimate, non- biological receptor 
moiety. 
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28. A method according to claim l wherein the bacterial cell 
being targeted is a cell comprising a gene coding for a gene ' 
product which, when expressed has a killing or cell function- 
limiting effect in said cell, the expression of said gene 
coding for the cell killing or cell fiinction- limiting gene 
product being regulated in such a manner that the bacterial 
cell when targeted will be killed or limited in its function 
in a pre -determined manner. 



29. A recombinant or mutant bacterial adhesin variant derived 
10 from a naturally occurring parent adhesin, said adhesin 
variant having altered binding properties relative to the 
naturally occurring adhesin from which it is derived, the 
altered binding properties including binding to at least one 
receptor to which the parent adhesin does not bind. 

15 30. An adhesin variant according to claim 29 which is derived 
from a naturally occurring adhesin isolated from a cell 
structure selected from the group consisting of a capsule, a 
lipopolysaccharide layer, an outer membrane protein, a 
flagellum, a pilus, a fimbria, a non-fimbrial adhesin (NFA) 

20 and an afimbrial adhesin (AFA) . 

31. An adhesin variant according to claim 29 or 30 which is a 
protein having an amino acid sequence differing by at least 
one amino acid residue from its parent protein adhesin. 



25 



32 . An adhesin variant according to claim 29 which is a FimH 
mannose-sensitive adhesin having an amino acid sequence which 
differs from the E. coli PC31 FimH adhesin as defined in 
Table 1 herein by at least one amino acid, said FimH adhesin 
binding to a receptor selected from the group consisting of a 
domain where mannosyl residues are not terminal and a domain 
30 devoid of saccharide. 



33. An adhesin variant according to claim 32 which is at 
least 90% homologous to the PC31 FimH adhesin. 
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34. An adhesin variant according to claim 32 which is a 
chimeric adhesin comprising amino acid sequences from differ- 
ent FimH adhesins, 

35. An adhesin variant according to claim 29 which binds to a 
5 receptor moiety selected from the group consisting of an 

animal receptor moiety, a plant receptor moiety and an inani- 
mate receptor moiety. 

36. An adhesin variant according to claim 29 which is part of 
a fusion protein comprising the adhesin variant and a 

10 heterologous polypeptide. 

37 An adhesin variant according to claim 36 wherein the 
heterologous polypeptide is selected from the group consist- 
ing of an epitope, an enzyme, a toxic gene product and an 
antibody . 

15 38. A FimH adhesin comprising 279 amino acids, having an 

amino acid sequence which differs from the E. cold. PC31 FimH 
adhesin as defined in Table 1 herein by at least one amino 
acid. 



39 . A FimH adhesin according to 
20 acid sequence which is selected 
appearing in Fig. 5 herein with 
CI#7 or CSH-50. 



claim 38 which has an amino 
from the group of sequences 
designations CI#12, CI#4, 



40. An adhesin according to claim 38 which binds to a 
receptor selected from the group consisting of a domain where 
raannosyl residues are not terminal, a domain devoid of 
saccharide, a glycolipid, a glycoprotein, a protein, a 
polypeptide and a peptide. 



41. An adhesin according to claim 3 8 which when tested for 
30 binding to yeast mannan (Mn) , human plasma fibronectin (Fn) , 
periodate treated Fn and the synthetic peptide FnSpl compris 
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ing the first 30 amino acids of Fn, only binds to Mn (M class) . 

42. An adhesin according to claim 38 which when tested for 

binding to yeast mannan (Mn) , hviman plasma fibronectin (Fn) , 

periodate treated Fn and the synthetic peptide FnSpl compris- 

5 ing the first 3 0 amino acids of Fn, binds to Mn and Fn (MF 
class) . 

43. An adhesin according to claim 38 which when tested for 
binding to yeast mannan (Mn) , human plasma fibronectin (Fn) , 
periodate treated Fn and the synthetic peptide FnSpl compris- 
ing the first 30 amino acid residues of Fn, binds to all of 
these (MFP class) . 

44. An adhesin according to claim 38 which when tested for 
binding to five Fn- fragments obtained by thermolysin treat- 
ment, only binds to the 40-kDa gelatin-binding fragment. 

45. An adhesin according to claim 38 which when tested for 
binding to five Fn- fragments obtained by thermolysin treat- 
ment, binds to all five fragments. 

46. An adhesin according to claim 38 which is at least 90% 
homologous to the PC31 FimH adhesin. 

47. An adhesin according to claim 38 which is a chimeric 
adhesin comprising amino acid sequences from different FimH 
adhesins . 

48. An adhesin according to claim 38 comprising an amino acid 
sequence which differs from the E. coli PC31 FimH adhesin by 
at least one amino acid occurring between residues 27 and 119 
of the mature FimH sequence. 



49. An adhesin according to claim 48 which binds to a 
receptor moiety selected from the group consisting of a human 
receptor moiety, an animal receptor moiety and a plant 
3 0 receptor moiety. 
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50. A recombinant replicon comprising a DNA sequence selected 
from the group consisting of a secjuence coding for a recombi- 
nant bacterial adhesin as defined in claim 29 and a sequence 
coding for a FimH adhesin as defined in claim 38. / 

5 51. A recombinant replicon according to claim 50 wherein the 
DNA secjuence codes for a FimH adhesin having an amino acid 
sequence which differs from the E. coli PC31 FimH adhesin by 
at least one amino acid. 

52. A replicon according to claim 52 in which the DNA 

10 sequence is at least 90% homologous to the PC31 flwH gene. 

53. A replicon according to claim 50 in which the DNA 
sequence is a chimeric flmH gene con^srising DNA from differ- 
ent fimH genes. 

54. A replicon according to claim 50 in which the DNA 

15 sequence comprises a DNA sequence which differs from the E. 
coli PC31 fimH gene by at least one codon between the codons 
coding for amino acid residues 27 and 119 of the mature FimH 
sequence, 

55. A replicon according to claim 50 in which the DNA 
2 0 sequence comprises a further DNA sequence coding for a 

heterologous polypeptide • 

56. A replicon according to claim 55 wherein the polypeptide 
is selected from a group consisting of an epitope, an enzyme, 
a toxic gene product and an antibody. 

25 57. A replicon according to claim 55 wherein the further DNA 
sequence codes for a gene product which is selected from a 
pesticidally active gene product and a pollutant -degrading 
gene product. 

58. A replicon according to claim 50 wherein the DNA sequence 
30 is isolated from an Eiiterobacteriaceae species. 



10 
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59. A fusion protein corr^jrising an adhesin selected from the 
group consisting of a recombinant bacterial adhesin variant 
as defined in claim 29 and a FimH adhesin as defined in claim 
38, and a heterologous polypeptide. 

60. A fusion protein according to claim 59 wherein the 
heterologous polypeptide is selected from an epitope, an 
enzyme, a toxic gene product and an antibody. 

61. A fusion protein according to claim 59 which carries a 
non-covalently bound coit^jound. 

62. A bacterial cell which expresses an adhesin selected from 
the group consisting of a recombinant bacterial adhesin 
variant as defined in claim 29 and a FimH adhesin as defined 
in claim 38. 



63. A recombinant bacterial cell according to claim 62 which 
15 comprises a recombinant replicon as defined in claim 50. 

64. A bacterial cell according to claim 62 which is selected 
from Enterobacteriaceae, Pseudomon^daceae , Vibrionaceae and 
Baccilaceae . 



20 



25 



65. A bacterial cell according to claim 62 which further 
expresses a gene product selected from the group consisting 
of a pesticidally active compound, an immunologically active 
gene product and a pollutant -degrading active compound. 

66. A bacterial cell according to claim 62 in which the 
recombinant adhesin variant is expressed as a fusion protein 
comprising the adhesin variant and a further polypeptide. 



67. A bacterial cell according to claim 62 which further 
comprises a gene coding for a gene product which, when 
expressed has a killing or cell function- limiting effect i 
said cell, the expression of said gene coding for the cell 
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killing or cell function- limiting gene product being regu- 
lated in such a manner that the bacterial cell when targeted 
to receptor in a specific location will be killed or limited 
in its function in a pre -determined manner. 



5 68. A method of isolating a bacterial cell expressing an 
adhesin having modified binding properties relative to a 
natively expressed adhesin, comprising identifying in the 
bacterial cell DNA sequence (s) coding for the binding 
domain (s) of said natively expressed adhesin and substituting 

10 at least one codon herein, whereby a modified adhesin mole- 
cule is expressed that is different in at least one amino 
acid from the adhesin expressed natively, and selecting a 
bacterial cell expressing the modified adhesin having an 
altered adhesion phenotype relative the natively expressed 

15 bacterial adhesin. 



69. A method according to claim 68 wherein a non-adhesin 
compound is associated with the adhesin. 

70. A method according to claim 69 wherein the non-adhesin 
compound is associated with the adhesin by being expressed 
with the adhesin as part of a fusion protein comprising the 
adhesin. 



71, A method according to claim 68 which in a further step 
comprises binding non-covalently a compound to the adhesin 
when expressed. 

72, A method according to claim 68 wherein the natively 
expressed adhesin is a FimH adhesin. 

73. A method according to claim 68 wherein the codon (s) 
is/are substituted by mutagenization. 

74. A method of preparing a recombinant bacterial cell that 
binds to a specific receptor moiety, comprising introducing 
into a bacterium that does not produce an adhesin binding to 
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said receptor moiety, a DNA sequence coding for an adhesin 
binding to the receptor moiety, and selecting a bacterial 
cell expressing the DNA sequence . 

75. A method according to claim 74 wherein the DNA sequence 
5 coding for an adhesin binding to the receptor moiety is a 

sequence coding for a FimH adhesin. 

76. A method according to claim 74 wherein the DNA is intro- 
duced by transforming the bacterial cell with a recombinant 
replicon as defined in claim 50. 

10 77. A method according to claim 74 wherein a non-adhesin 
compound is associated with the adhesin. 

78. A method according to claim 77 wherein the non-adhesin 
compound is associated with the adhesin by being expressed 
with the adhesin as part of a fusion protein comprising the 

15 adhesin. 

79. A method according to claim 74 which in a further step 
comprises binding non-covalently a compound to the adhesin 
when expressed. 



20 
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80. A method of providing a mutant bacterial cell having 
fimbriae which binds to a moiety to which the wild- type cell 
from which the mutant cell is derived does not bind, compris- 
ing contacting a population of said wild- type cell with said 
moiety, removing the contacted cells which do not bind to the 
moiety, cultivating cells binding to the moiety to obtain a 
culture which is enriched with regard to cells binding to the 
moiety and selecting from said culture a mutant cell binding 
to said moiety. 



81. A method according to claim 80 wherein the moiety with 
which the wild- type cell population is contacted, is a 
3 0 casein. 
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82. A method of isolating a compound from a solution or 
suspension containing the compound, the method comprising 
contacting the solution or the suspension with a fusion . 
protein according to claim 59 wherein the heterologous 
5 polypeptide has an affinity to the conpound to be isolated 



83. A coii5)osition comprising a population of a bacterial cell 
as defined in claim 62. 
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